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TITLE OF THE INVENTION 
RECORDING, EDIT, AND PLAYBACK METHODS OF AUDIO 
INFORMATION, AND INFORMATION STORAGE MEDIUM 

CROSS-REFERENCE TO RELATED APPLICATIONS 
5 This application is based upon and claims the 

benefit of priority from the prior Japanese Patent 
Application No. 2000-048250, filed February 24, 2000, 
the entire contents of which are incorporated herein by 
reference. 

10 BACKGROUND OF THE INVENTION 

The present invention relates to recording, 
playback, and edit methods of audio information, and an 
information storage medium used in these methods. 

More specifically, the present invention relates 
15 to recording, playback, and edit methods of audio 

related information with respect to an information 
storage medium that allows sound recording (or 
recording) and playback of audio related information, 
and a data structure recorded on the information 
2 0 storage medium. 

Furthermore, the present invention relates to a 
technical field which pertains to a display method for 
displaying the contents of management information 
recorded on an information storage medium on which both 
25 playback sequence information used to sequentially play 

back information recorded on the information medium, 
and another playback sequence information that the user 



can designate are recorded as the management 
information, and to an edit method using the display 
result. 

The DVD forum issued on September of 1999 "Part 3 
VIDEO RECORDING DVD Specifications for Rewritable/ 
Re-recordable Discs" as specifications that allow 
recording and playback of video information on an 
information storage medium. 

In video information, large units such as "video 
recording units" or "titles corresponding to program 
units" which make up video contents are present. In 
the above specifications, a management unit called 
Video Object is present for "video recording unit", and 
a management unit called Program is present for 
"program unit or title". 

The DVD forum is currently examining 
specifications that allows recording/playback of audio 
information and aims at high compatibility with the 
aforementioned Video Recording specifications as Audio 
Recording specifications. 

In audio information, recording/playback is done 
using very small units called "tracks" corresponding 
to "tunes". If management information for audio 
information includes a management unit corresponding 
to "track", a new layer corresponding to "track" 
must be added to the hierarchical structure of the 
aforementioned Video Recording specifications, thus 
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impairing high compatibility with the aforementioned 
Video Recording specifications* 

BRIEF SUMMARY OF THE INVENTION 
In order to combat the above problems, it is an 
5 object of the present invention to provide a data 

structure that can easily manage in units of tracks 
unique to Audio Recording while assuring high 
compatibility with Video Recording specifications, and 
optimal recording, playback, and edit methods of audio 
10 related information to be recorded on an information 

storage medium in association with the data structure* 
The present invention is not limited to the above 
object, and has as another object to provide an "edit 
method of audio information alone which is optimal to 
15 manage in units of tracks", "edit method that combines 

audio information and still picture information which 
are optimal to manage in units of tracks", and "display 
method that makes the aforementioned edit processes 
very easy". 

2 0 in order to achieve the above objects, according 

to the present invention: 

1* PGC (Program Chain) information which is 
included in management information that pertains to 
audio information, and indicates the playback sequence 
25 can have break information of audio tracks* 

That is, PGCI can record break information of 
audio tracks . 



• 



2 . A program in original program chain 
information as management information that pertains to 
an original track can correspond to the original track. 

3. Track head entry point information indicating 
5 break information of audio tracks is recorded in cell 

information in a user-defined PGC information table as 
management information that pertains to a play list, so 
that the track head entry point can have various kinds 
of information unique to audio tracks. 

10 4. When the user designates still pictures which 

are to be displayed simultaneously upon playing back a 
given audio track, the display timings of respective 
still pictures upon playing back audio information are 
automatically computed on the basis of the playback 

15 time of the audio track and the number of designated 

still pictures, and that display timing information can 
be automatically recorded in management information. 

5. An original list and play list can be 
simultaneously displayed on the screen (the same 

20 applies not only to Audio Recording but also to Video 

Recording) . 

6. In the present invention, a new track can be 
formed on a play list by collecting some original 
tracks in an original list. Alternatively, the 

25 contents of an original track can be partially erased. 

In such case, the following mode (A) or (B) can be 
selected in accordance with the information contents of 



an original track display mode. 

(A) All still pictures displayed upon playing 
back an original track are used as those to be 
displayed upon playing back a new track on a play list 
or all still pictures displayed before partial erase 
are also displayed after partial erase. 

(B) Only still pictures , which fall within a 
specific range, of those displayed upon playing back an 
original track are used as those to be displayed upon 
playing back a new track on a play list or still 
pictures displayed within the partial erase range are 
not displayed after partial erase. 

That is, upon playback, one of these modes (A) and 
(B) can be selected to set the corresponding still 
pictures as those for a new track on the play list. 

7. An arbitrary scene of a movie object can be 
extracted as a still picture, and can be registered in 
a still -picture AV file information table as a still 
picture that can be displayed simultaneously with an 
audio object, 

8. In the present invention, still pictures are 
designated in units of tracks, and designation 
information of a representative picture which indicates 
the track contents is provided to the management 
information, and is provided independently of 
designation information of still pictures displayed 
upon playing back audio tracks. 
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9. Display range information of a representative 
audio which indicates the contents of an audio track is 
assured in an area for recording information unique to 
each audio track. 
5 Additional objects and advantages of the invention 

will be set forth in the description which follows, and 
in part will be obvious from the description, or may be 
learned by practice of the invention. The objects and 
advantages of the invention may be realized and 

10 obtained by means of the instrumentalities and 

combinations particularly pointed out hereinafter. 
BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWING 
The accompanying drawings, which are incorporated 
in and constitute a part of the specification, 

15 illustrate presently preferred embodiments of the 

invention, and together with the general description 
given above and the detailed description of the 
preferred embodiments given below, serve to explain the 
principles of the invention. 

2 0 FIG. 1 shows an example of a management data 

structure that pertains to audio track information 
according to the present invention; 

FIG. 2 shows an example of the directory structure 
of a still picture file, audio file, and text file 

25 which are associated with reproducible audio informa- 

tion recorded in an information storage medium (e.g., a 
DVD-AUDIO recording disc) according to the present 
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invention; 

FIG. 3 shows an example of a management 
information data structure that pertains to audio 
information according to the present invention; 
5 FIG. 4 shows an example of the management 

information data structure that pertains to still 
picture information recorded in the information storage 
medium according to the present invention; 

FIG. 5 shows an example of the management 
10 information data structure that pertains to text 

information recorded in the information storage medium 
according to the present invention; 

FIGS. 6A and 6B show screen images upon creating a 
play list according to the present invention; 
15 FIG. 7 shows the data structure of management 

information which is associated with the play list 
according to the present invention; 

FIG. 8 is an explanatory view showing the 
relationship between the play list and audio object 
2 0 files according to the present invention; 

FIGS. 9A and 9B are explanatory views comparing 
information contents recorded in track head entry 
points (program information) and still picture entry 
points according to the present invention; 
25 FIG. 10 is an explanatory view of a link method to 

still picture information which is associated with the 
play list according to the present invention; 



FIG. 11 is an explanatory view of a link method to 
text information which is associated in units of tracks 
according to the present invention; 

FIG* 12 is an explanatory view of a link method to 
still picture information which is associated with an 
original track according to the present invention; 

FIG. 13 is an explanatory view of a link method to 
text information which is associated with an original 
track according to the present invention; 

FIG. 14 is a block diagram showing an example of 
the block arrangement of a recording/playback apparatus 
{e.g., a DVD-AUDIO recorder /player ) according to the 
present invention ; 

FIG. 15 is a flow chart showing an example of a 
recording method of audio related information on an 
information storage medium according to the present 
invention; 

FIG. 16 is a flow chart, continued from the flow 
chart of FIG. 15, showing the remaining steps of 
FIG. 15; 

FIG. 17 is a flow chart for explaining an example 
of a partial erase method of an original track 
according to the present invention; 

FIG. 18 is a flow chart, continued from the flow 
chart of FIG. 17, showing the remaining steps of 
FIG. 17; 

FIG. 19 is a flow chart for explaining a display 
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process of a play list creation window according to the 
present invention ; 

FIG. 20 is a flow chart showing an example of a 
play list creation method according to the present 
5 invention; 

FIG. 21 is a flow chart, continued from the flow 
chart of FIG. 20, showing the remaining steps of 
FIG. 20; 

FIG. 22 is a flow chart for explaining a method of 
10 using video information as still picture information to 

be displayed simultaneously with audio information; 

FIG. 23 is a flow chart, continued from the flow 
chart of FIG. 22, showing the remaining steps of 
FIG. 22; 

15 FIG. 24 is a flow chart showing a playback 

sequence for playing back audio information in units of 
tracks ; 

FIG. 25 is a flow chart, continued from the flow 
chart of FIG. 24, showing the remaining steps of 
20 FIG. 24; 

FIG. 2 6 shows an example of the data structure of 
a part (UD_PGCIT shown in FIG. 1) of a real time 
recording audio manager (RTR_AMG); 

FIG. 27 shows an example of the data structure of 
25 a program chain information (PGC Information) contained 

in the real time recording audio manager (RTR_AMG) 
shown in FIG. 26; 



FIGS. 28A and 2 8B respectively show examples of 
entry points in an original PGC and in a user-defined 
PGC; 

FIG. 2 9 shows an example of contents of cell entry 
point information (C__EPI of type Al; C_EPI# shown in 
FIG. 27); 

FIG. 30 shows an example of the data structure of 
another part (AUDFIT shown in FIG. 3) of the real time 
recording audio manager (RTR_AMG); 

FIG. 31 shows an example of contents of an audio 
object unit entry (A0BU_ENT) corresponding to A0BU_ENT 
#n (n = integer number) shown in FIG. 30; 

FIG. 32 illustrates a concept of AOBU accesses for 
presenting contents (audio frames) of audio object 
units AOBUs; 

FIG. 33 illustrates a concept of AOBU entries 
(A0BU_ENT#) ; 

FIG. 34 shows an example of the data structure of 
still another part (ASVFIT shown in FIG. 4) of the real 
time recording audio manager (RTR_AMG); 

FIG. 35 shows an example of contents of an audio 
still video object entry (ASVOB_ENT) corresponding to 
ASVOB_ENT #n (n = integer number) shown in FIG. 34, or 
to ASV0B_ENT #1 shown in FIG. 4; 

FIG. 36 shows an example of the data structure of 
the audio still video object (ASVOB); 

FIG. 37 illustrates a concept of ASVOB accesses; 



FIG. 38 shows an example of the data structure of 
yet another part (TXTDT_MG shown in FIG. 5) of the real 
time recording audio manager (RTR_AMG); 

FIG. 39 illustrates an example of usage of primary 
text information (e. g., PRM__TXT shown in FIG. 29); 

FIG. 4 0 is an explanatory view of a presentation 
of Audio and Audio Still Video (ASVOB) ; 

FIG. 41 shows an example of the structure of an 
original PGC (ORG_PGCI shown in FIG. 26); 

FIG. 42 shows an example of the structure of a 
user-defined PGC (UD_PGCIT shown in FIG. 1 or FIG. 26); 

FIG. 43 is a view for explaining an example of an 
entry point for the representative audio; 

FIG. 44 shows an example of contents of cell entry 
point information (C_EPI of type D2 ) ; 

FIG. 45 shows an example of contents of cell entry 
point information (C__EPI of type Bl); 

FIG. 4 6 shows an example of contents of cell entry 
point information (C_EPI of type B2 ) ; 

FIG. 47 shows an example of contents of cell entry 
point information (C_EPI of type C2 ) ; 

FIG. 4 8 shows an example of contents of PGC 
general information (PGC_GI shown in FIG. 1(g) or 
FIG. 27); 

FIG. 49 shows an example of contents of program 
information (PGI# shown in FIG. 27); and 

FIG. 50 shows an example of contents of 
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representative picture information (REP_PICTI shown in 
FIG. 49). 

DETAILED DESCRIPTION OF THE INVENTION 
An embodiment of the present invention will be 
5 described in detail hereinafter with reference to the 

accompanying drawings . 

FIG. 1 shows an embodiment of the present 
invention. As shown by (a) and (b) of FIG. 1, lead-in 
area 110 , an area for volume/file configuration 

10 information 111, data area 112, and lead-out area 113 

are assured on rewritable disc-shaped information 
storage medium 100. As shown by (c) of FIG. 1, data 
area 112 as an area in which a user can record 
information in medium 100 has a format in which general 

15 computer information recording area 120 and audio/video 

related information recording area 121 can be present 
together. Audio/video contents information is called 
an Object. As shown by (d) of FIG. 1, video contents 
information is recorded in VR_movie object recording 

2 0 area 131, and audio contents information is recorded in 

AR_audio object recording area 133. 

In the embodiment of the present invention, 
simultaneously with playback of audio information, not 
only still pictures can be displayed but also Real-Time 

25 Text information that changes in synchronism with audio 

information like a word card can be simultaneously 
displayed. 
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In the embodiment of the present invention, the 
still pictures are recorded in AR_still picture object 
recording area 132 , and the Real-Time Text information 
is recorded in AR__RtText Object recording area 134. 
5 In the embodiment of the present invention, the 

audio information, still picture information, and Real- 
Time Text information are generally called "audio 
related information". The contents, attribute 
information, display control information, and the like 

10 of such object information (contents information) are 

recorded together in management information recording 
area (RTR_AMG) 130 shown in (d) of FIG* 1, 

As shown by (e) of FIG. 1, management information 
recording area 130 contains real-time audio management 

15 information ( RTR_AMGI ; audio general information such 

as an attribute and the like) 140, movie AV file 
information table (M_AVFIT; information such as a movie 
recording position and the like) 141, still picture AV 
file information table (S_AVFIT; information such as a 

20 still picture recording position and the like) 142, 

audio AV file information table (A_AVFIT; information 
such as an audio recording position and the like) 143, 
original PGC information (ORG__PGCI) 144, user-defined 
PGC information (UD__PGCI) 145, text data manager 

25 (TXTDT__MG) 14 6, and manufacture information table 

(MNFIT) 14 7. 

One and only original PGC is present in 
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information storage medium 100, and a plurality of 
user-defined PGCs can be set. As shown by (f) of 
FIG, 1, management information which pertains to such 
user-defined PGC is recorded in first user-defined PGC 
5 Information #1/156 to m-th user-defined PGC information 

#m/157. 

These pieces of information are managed together 
in user-defined PGC information table 145. More 
specifically, user-defined PGC information table 

10 information (UD_PGCITI) 150 indicates tables recorded 

in this table. In order to search for PGC information, 
user-defined PGC information (UD_PGCI) search pointers 
151 and 152 are recorded. 

Each object information (contents information) 

15 mentioned above is recorded in an independent file in 

units of object contents. 

More specifically, as shown in FIG. 2, all pieces 
of audio information are recorded together in file 
AR_AUDIO.ARO 221, all pieces of still picture 

20 information are recorded together in file AR_STILL. ARO 

213, all pieces of Real-Time Text information are 
recorded together in file AR_RT_TEXT . ARO 222. 

In the embodiment of the present invention, one 
scene of a video in a video information file defined on 

25 the Video Recording specifications is extracted as a 

still picture, and is displayed simultaneously with 
audio information. Video information file VR MOVIE. VRO 
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212 used at that time is also recorded in single 
DVD__RTAV directory 210. Information in management 
information recording area 130 that systematically 
manages these object files is recorded in file 
5 AR__MANGR.IFO 211 and its backup file AR_MANGR.BUP 215. 

The framework of the data structure of management 
information which is recorded on the information 
storage medium that allows sound recording (or 
recording) and playback of audio related information 

10 has the same structure as that in the Video Recording 

specifications constituted by the DVD forum for the 
sake of compatibility , as shown in (e) of FIG. 1. 

As in the specifications "Part 3 VIDEO RECORDING 
DVD Specifications for Rewritable /Re-recordable Discs" 

15 constituted by the DVD forum on September of 1999, 

information indicating the playback sequence of audio 
related information is recorded in PGC (Program Chain) 
information 144 (original program chain) and PGC 
information 145 (user-defined program chain). 

2 0 That is, upon playback, minimum basic units to be 

continuously played back in audio related information 
are called cells, and a PGC (Program Chain) is formed 
as a playback sequence indicating a linkage of the 
cells. 

25 All pieces of management information that pertain 

to cells are recorded in first cell information #1/164 
to sixth cell information #6/169 ((g) of FIG. 1 and (d) 
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of FIG. 8). The portion (i) of FIG. 1 shows the 
presence of audio tracks 1, 2, and 3 as a result of 
editing , and (h) of FIG. 1 shows that track head entry 
points (THEP; or C_EPI) 171, 172, and 173 that 
5 designate the heads of audio tracks are recorded. Each 

of these track head entry points (THEP; C_EPI) 171, 172, 
and 173 designates cell information for playing back 
objects of the corresponding track. 

The data structure of management information which 

10 pertains to still picture information (still picture 

object) to be simultaneously displayed upon playing 
back audio information, and text information indicating 
unique information in units of tracks will be explained 
below using FIGS. 3 to 5. 

15 The contents of (a) to (e) of FIG. 1 may 

respectively be identical to those of (a) to (e) of 
each of FIGS. 3 to 5. 

Management information which pertains to audio 
information in file AR_AUDIO.ARO 221 shown in FIG. 2 is 

20 recorded in audio AV file information table (AUDFIT) 

143, as shown in (e) of FIG. 3. Note that the same 
reference numerals in FIG. 3 denote the corresponding 
portions in FIG. 1. 

The contents of (e) to (i) of FIG. 3 

25 hierarchically show A_AVFIT (audio AV file information 

table) 143, i.e., management information that pertains 
to audio. As shown in (f ) of FIG. 3, audio AV file 
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information table 143 is comprised of A_AVFIT 
information (AUD_FITI) 180, a plurality of pieces of 
audio object stream information (AUD_STI) 181, . 182, 
audio AV file information (AUDFI) 184, real-time text 
5 object stream information 186, and real-time text AV 

file information 189. As shown in (g) of FIG* 3, audio 
AV file information 184 is formed of audio AV file 
general information (AUDFI_GI) 190, audio object 
information search pointers (AOBI__SRP) 191, . .., 192, a 

10 plurality of pieces of audio object information (AOBI) 

196, . .., 197, and the like. 

As shown in (h) of FIG. 3, each audio object 
information is formed of audio object general 
information (AOB_GI/AOBU__GI ) 24 0, and audio object unit 

15 entries (AOBU_ENT) 241, , 248. As shown in (i) of 

FIG. 3, each audio object unit entry is made up of 
audio object data size (AOBU_SZ) 251, audio object unit 
presentation time (indicating, e.g., one second 
corresponding to AOBU_SZ) 252, real-time text position 

20 (indicated by, e.g., difference address) 253, and the 

like . 

Management information for real-time text 
information (Real-Time Text Object) (information, the 
display contents of which change in synchronism with 
25 audio information) recorded in file AR__RT_TEXT . ARO 222 

in FIG. 2 is also recorded in Real-Time Text Object 
Stream Information #1/186 ((f) of FIG. 3) and real-time 



text AV file information 189 ((f) of FIG. 3) in audio 
AV file information table 143 ((e) of FIG. 3). 

Upon recording audio information on disc-shaped 
information storage medium 100, a plurality of tracks 
are often recorded together* In this case, an audio 
information unit recorded at one time is called an 
audio object (AOB). 

Management information, i.e., audio object 
information (AOBI #1/196 to AOBI #i/197 in (g) of 
FIG. 3) is provided to each AOBs . In order to allow 
special playback of audio information such as Fast 
Front, Fast Reverse, time search, and the like, audio 
information is broken up (or divided) into units (audio 
object units) smaller than the AOB, and a plurality of 
sets of information of data sizes (audio object unit 
data size (AOBU_SZ) 251 in (i) of FIG. 3) and display 
times (audio object unit presentation time 252) of the 
respective units (audio object units) are recorded in 
the recording locations of audio object unit entries 
A0BU_ENT #1/241 to AOBU_ENT #h/248 shown in (h) of 
FIG. 3. 

In the embodiment of the present invention, 
position (relative address) information in file 
AR_RT_TEXT . ARO 222 where real-time text information 
(Real-Time Text Object) to be displayed upon playing 
back audio information at the head position in each 
unit (audio object unit) is recorded is recorded in a 



* 



corresponding one of audio object unit entries AOBU_ENT 
#1/241 to AOBU_ENT #h/248 ((h) of FIG. 3) as Real-Time 
Text Position information 253 ((i) of FIG. 3). 

Note that a program chain (PGC) is a generic 
5 conceptual unit to represent a chain of track which 

corresponds to the track set, and to represent a chain 
of part of track which corresponds to a play list. 

An original PGC (ORG_PGC) represents the track set 
which is a chain of tracks, and includes stream data 
10 stored in 11 . ARO " files (cf. FIG. 2). Only one 

original PGC shall exist in disc 100. 

A user defined PGC (UD_PGC) is a chain of part of 
tracks. The UD_PGC contains only navigation data, and 
each part of track refers to stream data belonging to 
15 the ORG_PGC. Therefore, creating or deleting any 

UD_PGC does not affect the ORG_PGC at all. 

An audio object (AOB) is audio stream data 
originated in one real time recording. 

A basic unit of the AOB is called an audio object 
2 0 unit (AOBU) which is formed of one or more audio frames 

and padding data. One audio frame shall not be 
included in two AOBUs. Padding data shall not exist in 
the middle of AOBU but may exist at the end of AOBU. 

In case of Linear PCM, however, padding data may 
25 exist in the middle of AOBU to align the unit of sample 

data into the boundary of the data pack of the AOBU. 
For the purpose of this alignment, the size of padding 
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data in the middle of AOBU shall be smaller than the 
size of the unit of sample data* 

In short, a data pack of AOBU can be filled with 
the padding data so that the pack does not cross from 
5 one sample data unit of Linear PCM to the next sample 

data unit thereof. 

The presentation time period of AOBU has a fix 
value according to coding and sampling frequency of the 
audio elementary stream, 
10 For example, when coding is Linear PCM and the 

sampling frequency is any of 4 8 kHz, 96 kHz, and 
192 kHz, the presentation period of AOBU is just 1 
second, or this AOBU has the size corresponding to the 
presentation time of 1 second. 
15 An audio still video object (ASVOB) is audio still 

video stream data played back with presentation of AOB. 
ASVOB represents one still picture. 

An audio still video unit (ASVU) is a collection 
of one or more {up to 99) ASVOB(s) which are presented 
2 0 while one or more tracks are played back. ASVU may be 

pre-loaded into a memory (buffer) before starting the 
presentation of the track(s). 

A program (PG) is, from a user's point of view, a 
data structure corresponding to an original track. The 
25 PG is formed of one or more cells. 

A cell is a data structure to represent a portion 
of a track. A cell in the original PGC is called an 



original cell, and a cell in the user defined PGC is 
called a user defined cell. A track in the track set 
is formed of one or more original cells, A part of 
track in a play list is formed of one or more user 
defined cells. The cell refers to a whole or a part of 
an AOB. 

An entry point (EP) is data to specify the 
playback behavior within a cell. There are four types 
of entry point (EP for a user defined track, EP for an 
index, EP for a display list, and EP for a representa- 
tive audio). Each cell has a set of entry points. 

A program chain information (PGCI) is a data 
structure to represent a total presentation of a PGC. 
The PGCI is used both for the original PGC and user 
defined PGC. The user defined PGC has only PGCI, and 
the cells in the PGCI refer to AOBs in the original PGC. 
The total presentation of a PGC is described as a 
presentation sequence of cells defined in the PGCI. 

An audio object information (AOBI) is a data 
structure to describe information regarding an AOB. 

An audio still video unit information (ASVUI) is a 
data structure to describe information regarding an 
ASVU. 

As shown in (f) of FIG. 4, still picture AV file 
information table 142 is formed of A__AVFIT information 
(ASVFITI) 260, one or more pieces of still picture VOB 
stream information (ASVJ5TI) 261-262, and still picture 
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AV file information ( S_AVFI/ASVFI ) 264. 

As shown in (g) of FIG. 4, still picture AV file 
information 264 is formed of S_AVFI general information 
(ASVFI_GI) 270/ one or more S_VOGI search pointers 
5 (ASVUI_SRP) 271-272, and one or more pieces of still 

picture VOB group information (ASVUI) 273-279. 

As shown in (h) of FIG. 4, still picture VOB group 
information 273 is formed of still picture VOB group 
general information (ASVU_GI) 280, and one or more 
10 still picture VOB entries (ASVOB_ENT) 281-289. 

As shown in (i) of FIG. 4, still picture VOB entry 
( AS VOB_ENT ) 281 includes information of still picture 
VOB entry type ( ASVOB_ENT_TY ) 291, and the size of one 
still picture (or size of the corresponding video part) 
15 (ASVOB_SZ) 292. 

A plurality of pieces of still picture information 
are often recorded together on disc-shaped information 
storage medium 100. Therefore, a plurality of pieces 
of still picture information to be recorded together 
2 0 are called a still picture VOB group, and management 

information associated with each still picture VOB 
group is recorded in corresponding still picture VOB 
group information (ASVUI #1/273 to ASVUI #g/279 shown 
in (g) of FIG. 4) so as to manage in units of still 
25 picture VOB groups. 

Still picture VOB entries ASVOB_ENT #1/281 to 
ASVOB_ENT #f/289 ((h) of FIG. 4) in still picture VOB 



group information ASVUI #1/273 to still picture VOB 
group information ASVUI #g/279 are used to manage 
the data size per still picture (one still picture 
size 292), 

A plurality of pieces of item text information are 
recorded together in text data manager (TXTDT_MG) 146 
shown in (e) of FIG. 5. 

As shown in (f) of FIG. 5, text data manager 146 
is formed of text data information (TXTDTI) 231 f one or 
more item text search pointers ( IT_TXT_SRP) 232-233, 
and one or more item texts {IT_TXT) 236-238. As shown 
in (g) of FIG. 5, each item text (IT_TXT) 239 includes 
general text information. 

In this way, all the item text contents (IT__TXT 
#1/236 to IT_TXT #e/238 shown in (f) of FIG. 5) can 
undergo a search such as "text search" to help audio 
information search . 

Note that the present specification hierarchically 
describes various data structures recorded on a storage 
medium (100), and these structures are described on a 
plurality of figures from various directions, but the 
same reference numerals denote the corresponding parts 
throughout the figures. 

The playback sequence information that pertains to 
audio related information includes two types of 
sequences : 

la. a playback sequence that plays back in the 



order in which data was recorded on information storage 
medium 100; and 

2a. a playback sequence that the user can 
arbitrarily designate . 

lb. Management information which pertains to the 
playback sequence that plays back in the order data was 
recorded on information storage medium is called 
"original PGC", and is named "original track 1" for the 
user, as shown in FIG. 6A. 

2b. Management information which pertains to the 
playback sequence that the user can arbitrarily 
designate is called "user-defined PGC", and is named 
"play list" for the user, as shown in FIG . 6B. 

A CD (Compact Disk), MD (Magneto-Optical Disk), 
and cassette tape have management units called tracks 
which are set in units of tunes of popular music or in 
units of movements of classical music. Upon creating 
the play list (user-defined PGC), the user may often 
create new track "C" by combining portions of original 
tracks "A" and "B" . 

A column of picture 5 in FIG. 6A indicates 
"representative pictures" as still pictures indicating 
the contents of individual tracks . In the embodiment 
of the present invention, a still picture which is 
displayed first upon playing back audio information is 
often used as a representative picture. However, the 
present invention is not limited to such specific 
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picture, and a representative picture can be designated 
independently of a still picture which is displayed 
first. Display mode 7 indicates the way still pictures 
to be displayed upon playing back audio information are 
5 switched, and their timings* 

* Display order modes include: 

Sequential,., still pictures to be displayed are 
switched in accordance with an order designated in 
advance; 

10 Random. . . display order of still pictures is 

randomly set (an identical picture may often be 
successively displayed in the worst case); and 

Shuffle... display order of still pictures is 
rearranged with respect to an order designated in 

15 advance to display pictures (an identical still picture 

is displayed once per cycle). 

* Display timing modes include: 

Slideshow. . . a still picture is switched to the 
next one at a predetermined timing; and 
20 Browsable... a still picture is switched to the 

next one when the user has pressed a switch (an 
identical still picture is kept displayed until the 
user presses the switch). 

Display mode 7 is set in units of tracks, and 
25 never changes within a given track. 

Time chart 11 visualizes a designation range when 
the user designates a portion of an original track upon 



creating a play list* 

Note that an "original track" is a logical unit of 
contents which are consecutively recorded. An original 
track may correspond to one track when it is copied 
from a digital source (such as CD or DVD disc), and may 
correspond to a song (or tune) when it is recorded from 
an analog source (e.g., recorded from a microphone or 
via broadcasting) . When part of a track is deleted, 
although the total presentation time of the original 
track is decreased, the original track remains. When 
an original track is modified or created as a result of 
editing the recorded contents, the original track is 
defined as a logical unit which is consecutively 
presented. 

The entire recorded contents of a disc (100) 
consisting of all tracks are represented by a "track 
set". The track set corresponds to a model which 
abstracts sequential media such as audio tapes . 
Therefore, the presentation of the track set needs to 
be defined to simulate the sequential media. When the 
track set is played back, the presentation order of 
original tracks becomes the same as the recorded order 
of the original tracks, unless any original tracks have 
been edited so as to change the presentation order from 
the original recording. When an original track or part 
of an original track is deleted, the track set remains 
although the total presentation time is decreased. 



When a new original track is recorded, it is appended 
at the end of the track set. The track set corresponds 
to the data structure called an original PGC. 

A specified segment of a track is called as an 
"index". Assume that a track corresponds to one 
symphony. Under this assumption, one movement of the 
symphony corresponds to an index. The start point of 
an index segment is indicated by an index point. An 
index point will be automatically set by an apparatus 
(or equipment) in recording time using given 
information such as index data of source contents f or 
set by user operation such as pausing or stopping of 
recording. The index may be inherited from the 
original track in the track set or may be defined in a 
play list. 

A subunit of recorded contents within an original 
track is called as "part of track" . A part of track is 
a consecutive part of an original track which is 
specified by a user. This abstraction is used only to 
define the play list itself. Therefore, there is no 
data structure directly representing the part of track. 

A list of part of tracks is called as a "play 
list". A play list allows a user to define any 
playback sequences, each of which may be a filtered 
view of the track set. A play list is defined as a 
user defined PGC. 

A user defined PGC is a chain of part of programs. 
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A program is a logical unit of recorded contents, which 
are recognized or defined by a user. A program is 
formed of one or more original cells, and is defined 
only in the original PGC. The user defined PGC 
5 contains only navigation data, and each part of program 

refers to stream data belonging to the original PGC. 
Therefore, creating or deleting a user defined PGC does 
not affect the original PGC at all. 

A specified segment in a play list is called as an 

10 "user defined track" . A user defined track is a user 

specified segment in the play list. A user can add 
multiple part of track to the play list* Not all part 
of track will be the beginning of a song (or tune) from 
a user's point of view. Therefore, a user is allowed 

15 to define the beginning of a song (or tune) at the top 

of specified part of track. 

FIG. 7 shows the relationship between management 
information that pertains to the created play lists, 
and the original PGC, and FIG. 8 shows the relationship 

20 between the management information, and audio object 

file AR_AUDIO.ARO 221 shown in FIG. 2. 

See the upper portion in the illustration of 
FIG. 7. Data (ASVOBs) reproduced from disc 100 is 
temporarily stored in a temporary storage (ASVU buffer) 

25 As shown in (a) of FIG. 7, the data in the ASVU buffer 

can include a plurality of still pictures (No. 1 to 
No. 9). In the example shown in (b) of FIG. 7, still 



pictures No. 1 to No. 4 pertain to audio track No. 1, 
still pictures No. 5 and No. 6 pertain to audio track 
No. 2, and still pictures No. 7 to No. 9 pertain to 
audio track No. 3. For instance , during the 
presentation of audio track No. 1, still pictures No. 1 
to No. 4 can be displayed at given timings. 

As shown by (c) of FIG. 7 , audio track No. 1 is 
associated with track head entry point (C_JEPI) #1/171 
and still picture entry points 21 to 23. Similarly, 
audio track No. 2 is associated with track head entry 
point (C_EPI) #2/172 and still picture entry point 24 , 
and audio track No. 3 is associated with track head 
entry point (C_EPI) #3/173 and still picture entry 
points 25 and 26. 

As shown by (d) of FIG. 7 , track head entry points 
(C_EPI) #1/171, #1/171, and #3/173 can indicate cell 
information (CI) #1/164, #4/167, and #5/168, respec- 
tively. Cell information (CI) #2/165 can be indicated 
by still picture entry points 21 and 22. Cell 
information (CI) #4/167 can be indicated by still 
picture entry point 24. Cell information (CI) #6/169 
can be indicated by still picture entry points 25 
and 26. 

The portion (e) of FIG. 7 shows audio objects (AOB 
#1 to AOB #5) to explain the concept, and audio object 
entries #1 to #5 are recorded in audio object 
information (AOBI #1/196 to AOBI #i/197 in (g) of 
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FIG. 3) which records management information that 
pertains to each AOB . 

The portion (h) of FIG. 7 shows original program 
chain 32 0. The portion (g) of FIG . 7 shows first (#1) 
5 program information, second (#2) program information, 

third (#3) program information, and fourth (#4) program 
information corresponding to this program chain, and 
(f) of FIG. 7 shows a plurality of pieces of original 
cell information 301, 302, 303, 304, and 305 corre- 

10 sponding to this program information. These pieces of 

original cell information respectively correspond to 
tunes (objects) "Automatic", "First Love", "In My Room", 
and "Another Chance" in (e) of FIG. 7. 

The portion (d) of FIG. 7 shows a case wherein 

15 first cell information #1/164 designates a portion of 

audio information with a track name (tune name) 
"Automatic", second cell information #2/165 and third 
cell information #3/166 designate audio information 
with a track name (tune name) "Another Chance", and 

20 fourth cell information #4/167 designates audio 

information with a track name (tune name) "In My Room" 
upon editing. In this case, upon playing back in 
accordance with the order these cell information #1/164 
to cell information #6/169 are arranged, after a 

25 portion of "Automatic" is played back/displayed, 

"Another Chance" and "In My Room" are played 
back/displayed in turn. 
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The contents of (a) to (d) of FIG. 8 may be the 
same as those of FIG. 7. 

Information indicating that range in file 
AR_AUDI0.AR0 221 (FIG. 2 or (f) of FIG. 8), which is to 
5 be played back by a single cell, is recorded in cell 

information (#1/164 to #6/169) ((d) of FIG. 8). The 
playback sequence of audio related information to be 
played back in accordance with one PGC information is 
set to play back/display in the order cell information 

10 #1/164 to cell information #6/169 which form that PGC 

information 156 are arranged, as shown in (g) of FIG. 1, 

For example, as shown in (d) of FIG. 8, when cell 
information #1/164 designates a portion of audio 
information with a track name (tune name) "Automatic", 

15 cell information #2/165 and cell information #3/166 

designate audio information with a track name (tune 
name) "Another Chance", and cell information #4/167 
designates audio information with a track name (tune 
name) "In My Room", "Automatic" is partially played 

20 back/displayed and, after that, "Another Chance" and 

"In My Room" are played back/displayed in turn in 
accordance with the order cell information #1/164 to 
cell information #6/169 are arranged. 

As shown in (e) of FIG. 8, in the embodiment of 

25 the present invention, since one cell can designate 

only a continuous playback range in AR_AUDIO.ARO 221 
((f) of FIG. 8) as an audio information file (in other 



32 - 



words, stepped (discrete) playback ranges in 
AR_AUDIO.ARO 221 cannot be played back), the present 
invention is characterized in that a portion of 
original track "A" is designated as one (user-defined) 
5 cell #1 (first cell), and a portion of original track 

"B" is designated as another (user-defined) cell #2 
(second cell) to define and manage new track "C" as a 
combination of these cells #1 and #2. 

Therefore, the embodiment of the present invention 

10 adopts a data structure in which one track is formed by 

a combination of one or more cells. 

As shown in (d) of FIG . 8, and as described above, 
each cell information (#1/164 to #6/169) records an AOB 
(audio object) indicated by that cell and the cell 

15 start and end times as time information. 

Upon playing back the designated cell, information 
within the designated time range in the designated AOB 
is played back. Using information of audio object unit 
entries A0BU_ENT #1/241 to AOBU__ENT #h/248 ((h) of 

20 FIG. 3) recorded in audio object information (AOBI 

#1/196 to AOBI #i/197) ((g) of FIG. 3), this time 
information is converted into a relative address in 
AR_AUDI0.AR0 221 ((f) of FIG- 8) to play back desired 
audio information . 

25 Note that playback start can be arbitrarily 

selected. When the user designates one of track head 
entry points 171, 172, and 173, playback can be started 



from any track (edited tune) "Automatic" + "Another 
Chance", "In My Room", or "First Love" + "Another 
Chance". In this case, edited tracks Nos . 1, 2, and 3 
are exemplified. Still pictures can also be designated 
in association with audio tracks. 

Management information for still picture 
information (still picture object) to be displayed 
simultaneously with playback of audio information is 
recorded in still picture AV file information table 
(S_AVFIT) 142 shown in (e) of FIG. 4. 

Cell information (CI) #1/164 to cell information 
(CI) #6/169 correspond to portions of original tracks, 
and include still picture entry points 21 to 2 6 ((c) of 
FIG. 7 or (c) of FIG. 8) that record management 
information which pertains to the second and subsequent 
still pictures to be displayed in corresponding tracks. 
Each cell information (each of CI #1/164 to CI #6/169) 
records "designation information of corresponding audio 
object information (AOBI #1/196 to AOBI #i/197 in (g) 
of FIG. 3)" and "information that pertains to the start 
and end times of each object", and the access address 
on AR_AUDIO.ARO 221 can be detected with reference to 
audio object entries #1 to #5 in (e) of FIG. 7 (audio 
object unit entries AOBU_ENT #1/241 to AOBUJSNT #h/248 
in (h) of FIG. 3) in corresponding audio object 
information (AOBI #1/196 to AOBI #i/197 in (g) of 
FIG. 3). 
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As a method of defining breaks of tracks unique to 
audio information while maintaining the aforementioned 
data structure so as to assure compatibility with the 
Video Recording specifications constituted by the DVD 
5 forum, the first major feature of the embodiment of the 

present invention lies in that "information indicating 
a break position of each track with respect to the 
audio information is recorded in PGC information as 
information indicating the playback sequence." 

10 As a method of realizing this, in the embodiment 

shown in FIG, 1, information recording areas called 
first (#1), second (#2), and third (#3) track head 
entry points (or cell entry points C_EPI #1 to C_EPI 
#3) 171, 172, and 173 are set in cell information CI 

15 #1/164, cell information CI #4/167, and cell 

information CI #5/168 as management information of 
cells which are located at the playback start positions 
of the individual tracks, so as to record information 
unique to tracks shown in FIGS . 9A and 9B. 

20 The contents recorded in track entry points C_EPI 

#1/171 to C_EPI #3/173 shown in (h) of FIG. 1 and still 
picture entry points 21 to 2 6 shown in (c) of FIG. 7 or 
(c) of FIG. 8 will be explained below using FIGS. 9A 
and 9B. "Information for designating the saving 

25 location of a still picture to be displayed" designates 

a corresponding still picture using number designation 
information of still picture VOB group information 
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(ASVUI #1/273 to ASVUI #g/279 in (g) of FIG. 4) and a 
still picture VOB entry (ASVOB__ENT #1/281 to ASVOB_ENT 
#f/289 in (h) of FIG. 4) therein. 

As shown in FIGS. 9A and 9B, as the types of entry 
5 points, a track head entry point ((h) of FIG. 1 or (d) 

of FIG. 11) or program information ((g) of FIG. 7) is 
available. The contents of entry point information 
(C_EPI) of this entry point include entry point type 
information (EP_TY in FIG. 29, etc.; identification 

10 information indicating a track head entry point or 

still picture entry point), information (RA_DUR in 
FIG. 44) for designating the display range of a 
representative audio that indicates the contents of the 
corresponding audio track (designated by the playback 

15 start and end times in the corresponding audio track), 

and information (REP_PICTI in FIG. 2 9 or FIG. 50) for 
designating the saving location of a representative 
picture that represents the contents of the 
corresponding audio track [designated by an S_VOB 

20 search pointer number (still picture VOB group number) 

and a VOB entry number therein]. 

Furthermore, the contents of entry point 
information include text information (primary text 
information PRM_TXTI: tune name, player name, singer 

25 name, song writer name, or the like) unique to the 

corresponding audio track, additional comment text 
information (central text information; item text 



IT_TXT), a display mode (display order mode, display 
timing mode) of still pictures in the corresponding 
audio track, display time range information of the 
corresponding still picture, the relationship between 
5 the still picture contents to be displayed and an 

original track (whether the same still pictures as 
those in the original track are displayed or other 
unique (newly set) still pictures are displayed), an 
erase inhibition flag, and the like. 

10 The still picture entry point contains entry point 

type information (identification information indicating 
a track head entry point or still picture entry point), 
information for designating the saving location of a 
still picture to be displayed [designated by an S_VOGI 

15 search pointer number (still picture VOB group number) 

and a VOB entry number therein], designation 
information of the display timing of the still picture 
of interest (to adjust the display timing between two 
objects by designating display time information of the 

20 corresponding audio object), display time range 

information of the corresponding still picture 
information, and the like* Note that other kinds of 
information may be added in addition to those described 
above . 

25 The present invention is not limited to the 

specific embodiment shown in FIG . 1. In place of track 
head entry points (C_EPI) 171 to 173 ((h) of FIG. 1), a 
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recording area of "flag information indicating if the 
corresponding cell is located at the head position of a 
track" (if "flag = 1", the corresponding cell is 
located at the head position of a track; if "flag = 0", 
5 the corresponding cell is located at the second or 

subsequent position of a track) may be set in cell 
information #1/164, #4/167, or #5/168 of a cell which 
is located at the head position of a track (although 
not shown). Furthermore, the embodiment of the present 

10 invention includes a method of recording information 

with the contents shown in FIGS . 9A and 9B in a general 
information recording area of each cell information 
#1/164, #4/167, or #5/168 ((g) of FIG. 1). 

The display window of original track 1 in FIG. 6A 

15 and the data structure of corresponding management data 

will be described below using FIG. 7. All pieces of 
information which pertain to original track 1 in 
FIG. 6A are managed in original program chain 
information 144. Original PGC 320 shown in (h) of 

20 FIG. 7 depicts the concept of such structure. As for 

the original track, each program corresponds to one 
track, i.e., one original track = one program. 
Information that pertains to each original track shown 
in FIGS. 9A and 9B is described in program information 

25 (#1/311 to #5/305) as a management information 

recording area of this program. 

One program consists of one or more original cells, 



and management information recording areas of original 
cell information #1/301 to original cell information 
#5/305 are assured for such original cells. In 
original track 1 (original PGC 320), AOBs #1 to #5 have 
one-to-one correspondence with all original cells 
#1/301 to #5/305, i.e., one cell = one AOB . 

The correspondence among FIGS . 6A and 6B, FIG. 7, 
and FIG. 8 will be explained below. Audio information 
with a track name (tune name) "Automatic" in FIG. 6A is 
recorded in AOB #1 in file AR__AUDI0 . ARO 221, management 
information that pertains to the original track is 
recorded in program information #1/311, and information 
that pertains to playback is recorded in original cell 
information #1/301. Likewise, audio information with a 
track name (tune name) "First Love" is recorded in AOB 
#2 in file AR^AUDIO.ARO 221, management information 
that pertains to the original track is recorded in 
program information #2/312, and information that 
pertains to playback is recorded in original cell 
information #2/302. Audio information with a track 
name (tune name) "Another Chance" is managed as one AOB 
immediately after recording, but since its track is 
partially erased, that audio information is broken up 
into two AOBs #4 and #5, and its cell information is 
broken up into two pieces of original cell information 
#4/304 and #5/305 accordingly ((e) of FIG . 7). However, 
since the track itself remains unchanged, program 



information #4/314 is maintained as a piece of 
information. 

The user creates a new track in a play list he or 
she wants by an edit process using the window shown in 
FIG. 6B* For example, assume that the user creates 
play lists #1 and #2, as shown in FIG. 6B. That is, 
the user creates new track No. 1 by joining the range 
from A to B of "Automatic" and whole "Another Chance" , 
and sets four still pictures No. 1 to No. 4 shown in 
(a) of FIG. 7 as those to be displayed during playback 
of this music. Then, the user creates new track No. 2 
by changing still pictures to be displayed of "In My 
Room", and creates new track No. 3 by joining the range 
from A to B of "First Love" and the range from C to D 
of "Automatic" and setting three still pictures No. 7 
to No. 9. 

FIG. 10 depicts the designation method of still 
picture information. The contents of (a) to (d) of 
FIG. 10 may be the same as those of FIG. 7. 

In fig. 10, all still pictures are recorded 
together in still picture object file AR_STILL . ARO 213 
((f) of FIG. 10) in units of still picture VOB groups 
(ASVU in (f) of FIG. 10 or in (e) of FIG. 10) #1 to #g, 
and management information of each still picture is 
recorded in a still picture VOB entry ((e) of FIG. 10) 
(ASVOB_ent# 281 to 299) in still picture VOB group 
information (ASVUI #1/273 to ASVU #g/279 in (e) of 
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FIG. 10). Cell information CI ((c) of FIG. 10) can 
refer to still picture VOB groups (ASVU in (e) and (f) 
of FIG. 10) via various entry points ((d) of FIG. 10). 

Meanwhile, movie object file VR_MOVIE.VRO 212 ((h) 
5 of FIG. 10) can refer to a still picture VOB group 

(e.g., ASVU #g in (f) of FIG. 10) via given still 
picture VOB entries (e.g., #g-l, #g-2 in (g) of 
FIG. 10). 

"Information for designating the saving location 

10 of a still picture to be displayed" in FIGS. 9A and 9B 

corresponds to each of "solid arrows" extending from 
track head entry points (C__EPI) #1/171 to #3/173 and 
still picture entry points 21 to 2 6 ((d) of FIG. 10) to 
still picture VOB entries 281 to 299 ( AS VOB__ENT ) in (e) 

15 of FIG. 10. 

"Information for designating the display timing of 
a still picture" designated in FIGS. 9A and 9B is 
timing designation information which is enabled when 
"Sequential Mode" is designated as the display order 

20 mode, and "Slideshow" as the display timing mode, and 

means time information at which display is switched to 
a still picture designated by one of still picture 
entry points 21 to 26 corresponding to the already 
displayed still picture. 

25 In the display mode, the designated still picture 

is kept displayed until 

the time designated by the next still picture 



entry point (21 to 26), or time at which the 
corresponding track comes to an end. 

In the present invention, the display switching 
time information is expressed by presentation time 
5 information of audio information. However, the 

present invention is not limited to such specific 
information. For example, differential time 
information from the playback start time of the 
corresponding track to the display switching time of 

10 the designated still picture may be used. "Display 

time range information of the corresponding still 
picture" is enabled when "Brows able Mode" is designated 
as the display timing mode. 

When the user has pressed a switch, a still 

15 picture displayed so far is switched to the one 

designated by the still picture entry point (21 to 26). 
After that, if the user does not press the switch, an 
identical still picture is kept displayed until the 
corresponding track comes to an end. 

20 When a maximum display time is designated by the 

"display time range information of the corresponding 
still picture", if the user does not press the 
changeover switch of still pictures until that time, 
display of the corresponding still picture is 

25 automatically stopped, and the screen is automatically 

switched to a "blue back". 

Conversely, when the user has inadvertently 



pressed the changeover switch continuously, still 
pictures are quickly switched in turn, and the user 
cannot watch still pictures at ease. 

When a minimum display time is set by the "display 
time range information of the corresponding still 
picture", even when the user continuously presses the 
changeover switch, the still picture to be displayed is 
inhibited from being switched for the set minimum 
display time. 

Since a still picture designated by the track head 
entry point (171 to 173) or program information (311 to 
314) is displayed simultaneously with the beginning of 
playback of the corresponding track, the need for 
"information for designating the display timing of a 
still picture" can be obviated. In the present 
invention, a representative picture of each track can 
be independently set by "information for designating 
the saving location of a representative picture that 
represents the contents of the corresponding audio 
track" in addition to display simultaneously with the 
beginning of playback of the corresponding track. 

Still pictures designated by the "information for 
designating the saving location of a representative 
picture that represents the contents of the corre- 
sponding audio track" shown in FIGS. 9A and 9B 
correspond to those shown in the columns of pictures 5 
and 6 in FIGS. 6 A and 6B. On the other hand, the 
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"information for designating the saving location of a 
representative picture that represents the contents of 
the corresponding audio track" shown in FIGS. 9A and 9B 
corresponds to each of "broken arrows" extending from 
track head entry points #1/171 to #3/173 and still 
picture entry points 21 to 26 to still picture V0B 
entries 281 to 299 in FIG. 10. 

As described above, since display mode 7 of still 
pictures is set in units of tracks, and never changes 
within a given track, it is recorded in the track head 
entry point (171 to 173) or program information (311 to 
314) as information unique to that track. Display 
modes 7 and 8 in FIGS. 6A and 6B show the contents set 
by "display mode of still pictures in the corresponding 
audio track" in FIGS. 9A and 9B. 

The contents of "text information (Primary Text 
information) unique to the corresponding audio track" 
in FIGS. 9A and 9B correspond to primary text 
information (51 to 53) in (e) of FIG. 11 mentioned 
above, and "tune name" information in that information 
is shown in the column of "track title 3" in FIG. 6A. 
"Additional comment text information" in FIGS. 9A and 
9B corresponds to each "arrow" extending from the track 
head entry point (#1/171 to #3/173) to item text 
(#1/236 to #e/238) in (d) of FIG. 11, and has 
information contents indicating "item text number". 

The information contents of the "relationship 
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between the still picture contents to be displayed and 
original track" are shown in "still 10" in FIG. 6B. 
That is, identification information indicating whether 
still pictures displayed in new tracks No* 1 to No. 3 
5 in FIG, 6B use those used in original track 1 

("original" in this case) or are uniquely set by the 
user independently of those used in original track 1 
("newly set" in this case) is given. 

In the present invention, an erasable or non- 
10 erasable area is set in units of tracks. Therefore, an 
"erase inhibition flag" = "1" is set for an audio track 
which is inhibited from being erased* 

As shown in FIGS . 9A and 9B f information unique to 
each track such as "tune name", "singer name", "player 
15 name", or the like can be recorded in each of track 

head entry points (C__EPI) 171 to 173 shown in (h) of 
FIG. 1 or (d) of FIG. 11. As a location for recording 
text information with a relatively small data size such 
as "tune name", "singer name", "player name", or the 
2 0 like, recording areas named primary text information 

(51 to 53) are present in track head entry points 
(C_EPI) 171 to 173. 

By contrast, information which is unique to each 
track but cannot be recorded in primary text 
25 information (51 to 53 in (e) of FIG. 11) due to its 

huge data size can be recorded in item text (IT_TXT 
specified by item text search pointer number 
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IT_TXT_SRPN) 236 to 23 8 shown in (f) of FIG- 11. Track 
head entry points (C_EPI) 171 to 173 shown in (d) of 
FIG . 11 may record only pointer information having 
information (a table of IT__TXT__SRPN) indicating the 
5 order of item texts (ITJTXT). 

FIG. 12 shows the setup state of still pictures in 
program information (#1/311 to #4/314). 

The portion (a) of FIG. 12 shows still pictures 
corresponding to original audio tracks ((b) of FIG. 12 ), 
10 and (c) of FIG. 12 shows an original program chain. 

Each original audio track corresponds to program 
information . 

The program information (#1/311 to #4/314 in (d) 
of FIG. 12) records information (REP_PICTI 41 to 44) 

15 that designates a representative picture indicating the 

corresponding audio track contents,, and still picture 
VOB entries ASV0B_ENT #1/281 to ASVOB_ENT #p+l/29 6 
(or ASVOB_ENTN) in (g) of FIG. 12 can be directly 
designated from that information. The portion (g) of 

20 FIG. 12 is an independent file, i.e., still picture VOB 

group information (ASVUI 273, 274; or ASVUN). 

Original cell information (CI #1/301 to CI #5/305 
in (d) of FIG. 12) does not have any track head entry 
point information (as well as still pictures to be 

25 displayed at the beginning of playback of an audio 

track) and consists of only a still picture entry point 
(C_EPI 31 to 39 in (f) of FIG. 12). 



* 
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Management information includes representative 
image designation information (REP_PICTI 41 to 4 4 in 
(d) of FIG. 12) , which designates a representative 
picture in units of tracks, and serves as designation 
5 information of a representative picture indicating the 

track contents. 

The designation information of a representative 
picture is recorded independently of designation 
information of still pictures to be displayed upon 

10 playing back an audio track. For this reason, an 

arbitrary still picture in a VOB group at a location 
other than a video object (VOB) group that records 
still pictures to be displayed upon playing back an 
audio track can be set as a representative picture, 

15 thus improving the degree of freedom. 

FIG. 13 shows still another embodiment or an 
embodiment that is compos sible with the above 
embodiment. The portion (d) of FIG. 13 shows the setup 
state of text information in program information (PGI 

20 #1/311 to PGI #4/314). "Text information unique to the 

corresponding audio track" in FIGS. 9A and 9B is 
recorded in primary text information (56 to 59; or 
PRMJTXTI) in program information (PGI #1/311 to PGI 
#4/214), as shown in (d) of FIG. 13. 

25 Also, "additional comment text information" in 

FIGS. 9A and 9B corresponds to each "arrow" extending 
toward item text (#1/236 to #e/238; or IT_TXT 239) in 



(e) of FIG. 13, and records information ( IT_TXT__SRPN ) 
indicating "item text to be designated". 

In the above description, information unique to an 
audio track shown in FIGS* 9A and 9B is recorded and 
managed in: 

program information in case of an original track; 

or 

a track head entry point in case of a play list. 

However, the present invention is not limited to 
such specific method, and the scope of the present 
invention includes a case wherein the locations of 
recording/managing unique information that pertains to 
an audio track may be reversed, or two different kinds 
of information may be recorded and managed in an 
identical location . 

That is, the scope of the present invention 
includes a case wherein program information is present 
in a user-defined PGC even for a play list, and the 
program information in the user-defined PGC records 
information unique to an audio track shown in FIGS. 9A 
and 9B. 

FIG. 14 shows the structure in the information 
recording/playback apparatus in the present invention. 

Disc drive 409 records /plays back information with 
respect to information storage medium 100. Various 
kinds of object information input from various input 
means 440 to 442, 412, and 413 are encoded by encoder 
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unit 401, and are then recorded on information storage 
medium 10 0 via disc drive 4 09. At the same time, main 
MPU 4 04 generates management information that pertains 
to each object information, and records the generated 
information on information storage medium 100 via disc 
drive 409. 

The input means include microphone 441, A/V input 
unit 412, key input unit 442, TV tuner 413, and digital 
camera 440. Also, the input means includes set top box 
(STB) 4 03 that receives a broadcast signal. Encoder 
unit 401 has analog-to-digital (A/D) converter 414 that 
receives an A/V input, and selector 415 for arbitrarily 
selecting one of the output signal (video signal) 
from A/D converter 414 and output video signal 423 from 
STB 403. 

Video encoder 416 encodes a video signal output 
from selector 415 by, e.g., MPEG to achieve compression 
coding, and supplies the encoded signal to formatter 
419. Audio encoder 417 executes a processes such as 
MPEG, PCM, or the like of an audio signal from A/D 
converter 414, and supplies the processed signal to 
formatter 419. Information from key input unit 442 is 
input to real-time text (RT_TEXT) encoder 418 and is 
then input to formatter 419 as text data. Buffer 
memory 420 is connected to formatter 419 and is used 
for time adjustment upon converting input data into a 
predetermined format. 
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The output (information converted into a 
predetermined format) from formatter 419 is input to 
digital processor (D-PRO) 410, and is then recorded on 
information storage medium 100 via disc drive 4 09 in 
5 correspondence with management information. Temporary 

storage 411 is connected to D-PRO 410 and serves as a 
buffer for data processes. D-PRO 410 appends error 
correction codes, modulates data, and so forth. 

Management information is generated by main MPU 

10 404. Also, management information read from 

information storage medium 10 0 is also interpreted by 
main MPU 4 04. Main MPU 404 includes an audio related 
data generation controller, audio related data playback 
controller, audio related data partial erase controller, 

15 and work RAM. Display 408 is connected to main MPU 404, 

and key input unit 407 is also connected to control 
this apparatus . 

Upon playing back information on information 
storage medium 100, a signal obtained by reading and 

20 photoelectrically converting recorded information by, 

e.g., optical information reading means of disc drive 
409 is input to D-PRO 410. Playback information is 
input to demultiplexer 425 of decoder unit 402 and is 
demultiplexed into video information, audio information, 

25 and text information. 

The video information is input to and decoded by 
video decoder 428. The audio information is input to 
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and decoded by audio decoder 430. Furthermore, the 
text information is input to and decoded by text 
decoder 429. 

The output video signal from video decoder 42 8 and 
5 output text from text decoder 429 are input to video 

processor (V-PRO) 438. The video signal output from 
video processor 438 is converted into an analog signal 
by digital-to-analog (D/A) converter 436 via video (V) 
mixing unit 405, and the analog signal is supplied to 
10 television display 437. 

Video mixing unit 405 can also composite a video 
signal from STB 403 with text. Frame memory 406 is 
connected to video mixing unit 405. The output from 
video mixing unit 405 can be supplied to personal 
15 computer 435 via interface 434. 

Audio decoder 430 mentioned above decodes an audio 
signal, and the decoded output can be derived as a 
digital output via interface 431. The decoded output 
is supplied to loudspeaker 433 via D/A converter 432. 
20 D/A converter 4 32 can receive an audio signal from 

STB 4 03. 

System clock 450 generates clocks to synchronize 
all units such as STB 421, decoder unit 4 02, encoder 
unit 401, main MPU 404, and the like. System clock 451 
25 generates reference clocks used to synchronize playback 

information and decoder unit 402 upon playing back a 
disc. 
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Upon playback , management information recorded in 
information storage medium 100 is temporarily recorded 
in the work RAM in main MPU 404 via disc drive 409. 
Using the management information temporarily recorded 
5 in the work RAM, object information to be played back, 

which is recorded on information storage medium 100 is 
read, and is then decoded by decoder unit 402. After 
that, the decoded output is output to loudspeaker 433, 
television display 437, or display 408. 

10 A method of creating a play list that pertains to 

audio related information as well as a user interface 
and the detailed structure of management data generated 
consequently will be explained below. 

As a characteristic feature of the embodiment of 

15 the present invention, when the user creates a play 

list, both a list of original track 1, and play lists 
#1 and #2 to be created by the user (FIGS. 6A and 6B) 
are displayed on display 408 shown in FIG. 14 to 
improve user 1 s convenience . 

20 The method of recording audio related information 

on information storage medium 100 will be explained 
below using FIGS. 15 and 16. 

Most of processes in the present invention read 
information in management information recording area 

25 13 0 recorded on information storage medium 100, and 

temporarily record the read information in the work RAM 
in main MPU 404 (step SI in FIG. 15). After a series 
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of processes, a process for recording management 
information, which is recorded in the work RAM and the 
contents of which have been partially changed on 
information storage medium 100 is executed (step S12). 
5 In the embodiment of the present invention shown 

in FIGS. 15 and 16, audio information is additionally 
recorded after file AR_AUDIO.ARO 221 initially (step 
S2), and after that, a process for rewriting management 
information in the work RAM is done. Upon changing the 

10 management information contents, management information 

that pertains to audio information is added/changed in 
audio object information. 

That is, in step S3 main MPU 404 generates program 
information, original cell information, and audio 

15 object information in correspondence with the audio 

track recorded in step S2, and additionally records 
them in the work RAM. 

Furthermore, in step S4 the relative address of 
each audio object unit of the recorded audio track in 

20 AR_AUDIO.ARO 221 is checked, and is additionally 

recorded in audio object information in the work RAM. 

The control asks the user if still pictures to be 
displayed simultaneously with audio information are set 
(step S5). If still pictures are not to be displayed 

25 simultaneously with audio (NO in step S5), the process 

jumps to step Sll in FIG. 16. When still pictures to 
be displayed simultaneously with audio are set (YES in 
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step S5), one of the following methods is selected, 

(1) If new still pictures are to be recorded 
simultaneously with recording of audio information (YES 
in step S6 ) , 

5 new still pictures to be recorded are recorded in 

file AR_S TILL .ARO 213. Management information of still 
pictures, i.e., still picture VOB group information, is 
generated in correspondence with the recorded pictures. 
That is, still picture information is recorded in 

10 AR_still picture object recording area 132 

(additionally recorded after the end of file 
AR_S TILL .ARO 213), and still picture VOB group 
information is generated in correspondence with that 
picture information and is additionally recorded in the 

15 work RAM (step 37 in FIG. 16). 

(2) If still pictures already recorded on 
information recording medium 100 are used (NO in 
step S6 ) , 

the user is asked or promoted to select still 
2 0 pictures to be displayed simultaneously with audio 

information (step S8 in FIG. 16). That is, the user 
selects still pictures to be displayed simultaneously 
with the corresponding audio track from still picture 
VOB groups already recorded in information storage 
2 5 medium 100. 

The information contents of "information for 
designating the display timing of a still picture" to 
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be set in a still picture entry point are automatically 
set in main MPU 4 04 of the information recording/ 
playback apparatus shown in FIG. 14 by computing: 

[display time per still picture] = [playback time 
5 of corresponding audio track] -r [the number of still 

pictures to be displayed in corresponding audio track] 
(the computation result value is recorded in 
"information for designating the display timing of a 
still picture" in the still picture entry point (31 to 

10 39) temporarily recorded in the work RAM) (step S9). 

The user is asked or promoted to input 
"information for designating the saving location of a 
representative picture that represents the contents of 
the corresponding audio track" , "text information 

15 unique to the corresponding audio track", "display mode 

of still pictures in the corresponding audio track" , 
and "erase inhibition flag", which are recorded in 
program information (311 to 314) and are set in units 
of audio tracks (steps S10 and Sll). 

20 More specifically, in step S10, the user sets a 

"representative image for the corresponding audio 
track" as a "display mode of still pictures", and that 
information is recorded in program information (311 to 
314) temporarily stored in the work RAM. 

2 5 In step Sll, the user sets "primary text 

information" and "item text information" using key 
input unit 407, and that information is recorded in 
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primary text information (56 to 59) and item text (236 
to 238) temporarily recorded in the work RAM. Also, 
pointer information to item text is recorded. 

When information unique to each audio track is 
5 recorded in program information (311 to 314) (step S10), 

a specific time range (e.g., 5 sec) after the playback 
start time of the corresponding audio track (not shown) 
is automatically recorded in "information for 
designating the display range of representative audio 

10 indicating the contents of the corresponding audio 

track" shown in FIGS. 9A and 9B. If this time range is 
to be changed, the user can re-set the display range of 
representative audio in an edit process. 

With a series of processes mentioned above, 

15 management information that pertains to audio related 

information is completed, and is recorded on 
information storage medium 100 via disc drive 4 09 
(step S12) . 

A method of partially erasing the contents of an 
2 0 original track in the present invention will be 

explained below. As shown by (e) and (f) of FIG. 7, 
when a central portion of an original track with a 
track name (tune name) "Another Chance" is partially 
erased, an audio object is broken up (or divided) into 
25 two objects like AOB #4 and AOB #5. Audio object 

information, original cell information #4/304, and 
original cell information #5/305 are broken up (or 



> 
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divided) into two pieces of information each 
accordingly, A processing method in the information 
recording/playback apparatus at that time will be 
described below using FIGS. 17 and 18. 
5 In step S21 of FIG. 17, disc drive 409 (FIG. 14) 

reads information (RTR_AMG) of management information 
recording area 130 in disc 100, and temporarily records 
the read information in the work RAM of main MPU 404. 
In step S22, the user designates a partial erase 

10 range in an original track (using time information). 

In step S23, audio object information that 
includes the original track designated by the user is 
broken up into two audio objects before and after the 
partial erase range. As the former half (before the 

15 partial erase range) audio object, existing audio 

object information is used, and an unnecessary audio 
object unit entry is deleted (by main MPU 404). 
Likewise, as the latter half (after the partial erase 
range) audio object, new audio object information is 

2 0 generated, corresponding information is copied from the 

source audio object entry, and is recorded in the work 
RAM. 

In step S24, the partial erase range in file 
AR_AUDIO.ARO 221 that records audio objects is erased. 
25 In short, when a user designates a partial erase 

range using time information (step S22), contents of 
the management information are changed accordingly 
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(step S23), and a corresponding portion in AR_AUDI0.AR0 
221 is erased (step S24). 

In step S25 of FIG. 18, main MPU 404 checks, based 
on display mode information that pertains to still 
5 picture information recorded in program information 

associated with the original track, if still pictures 
displayed within the partial erase range designated by 
the user are to be displayed after partial erase. 

If the corresponding still pictures are to be 

10 displayed after partial erase (YES in step S26), the 

display time of the audio track after partial erase is 
divided by the number of still pictures to compute the 
display time per still picture, thus updating the 
contents of "information for designating the display 

15 timing of a still picture" in still picture entry 

points 31 to 3 9 temporarily recorded in the work RAM 
(step S27) . 

Conversely, if the corresponding still pictures 
are not displayed after partial erase (NO in step S26), 

2 0 information of each still picture entry point included 

in the partial erase range designated by the user is 
detected from still picture entry points 31 to 3 9 
before partial erase, which are recorded in original 
cell information (301 to 305) (step S28). 

25 In step S29, the management information 

temporarily recorded in the work RAM is written back on 
management information recording area 130 in the 
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information storage medium via disc drive 409. 

How to handle still pictures displayed within the 
partial erase range is important. As a method of 
handling still pictures after partial erase, the 
5 present invention selects one of the following two 

methods : 

(A) all still pictures before partial erase are 
displayed upon playing back audio information after 
partial erase irrespective of the partial erase range 

10 (step S27) ; and 

In this case, "information for designating the 
display timing of a still picture" in FIGS . 9A and 9B 
is re-computed and automatically rewritten. 

(B) still pictures displayed within only the 
15 partial erase range are not displayed upon playback 

after partial erase (step S28). 

At this time, a first characteristic feature in 
the edit method of the present invention lies in that 
discrimination information indicating if still pictures 

20 displayed within only the partial erase range are 

allowed to be displayed is recorded in advance in 
management information, and one of (A) and (B) is 
selected based on that information (step S26). 

A second characteristic feature of the present 

25 invention lies in that the user can recognize the 

discrimination information. If the user can recognize 
that information, he or she can understand the selected 
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one of (A) and (B) , thus avoiding user's confusion. As 
the discrimination information that the user can 
recognize, the present invention uses display mode 7 
shown in FIG. 6A. 
5 That is, only when display mode 7 of original 

track 1 indicates "Slideshow + Sequential" ("display 
mode of still pictures in the corresponding audio 
track" in the program information at that time also 
records the same information), (B) is selected; when 

10 information other than the above information is 

recorded, (A) is selected. 

Prior to the description of the creation method of 
play list contents, a preparation method of display 
windows shown in FIGS. 6A and 6B as a characteristic 

15 feature of the present invention will be explained 

below using FIG. 19. 

Initially, information in management information 
recording area 130 recorded on information storage 
medium 100 is read, and is temporarily recorded in the 

20 work RAM in main MPU 404 (step S31). 

The management contents which pertain to original 
track 1 which indicates the playback sequence in the 
order data were initially recorded on information 
storage medium 100 is recorded in original program 

25 chain information 144 or 320, and information that 

pertains to the original track is described in program 
information #1/311 to #4/314 ((g) and (h) of FIG. 7), 
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as will be described later. 

As described above, information that pertains to a 
track of a play list is recorded in track head entry- 
points 171 to 173 in user-defined PGC information table 
5 145. Using such information, main MPU 404 generates a 

list window that pertains to original track 1 (step 
S32), individually generates list windows of tracks 
that pertain to play lists #1 and #2 (step S33), 
composites these windows (step S34), and displays the 

10 composited window (step S35). 

A creation processing method of the play list 
creation display window shown in FIGS. 6A and 6B will 
be explained below with reference to FIG. 19. 

Disc drive 40 9 reads information of management 

15 information recording area 13 0 in the disc, and 

temporarily stores the read information in the work RAM 
in main MPU 404 (step S31). Main MPU 404 interprets 
information that pertains to an original track recorded 
on disc 100 on the basis of the information contents of 

20 the temporarily stored program information (311 to 314) 

to generate display window contents that pertain to 
original track 1 (step 332). Main MPU 404 then 
extracts information associated with tracks in units of 
play lists using information of track head entry points 

25 171 to 173 in cell information 164 to cell information 

169 that form temporarily stored user-defined PGC 
information table 145, thus generating display window 
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contents which pertain to play list 2 (step S33). 

The display windows generated in steps S32 and S33 
are composited, and the composite window is transferred 
to the V mixing unit (step S34). The display window 
5 generated in main MPU 404 is displayed on display 408 

via D/A converter 436. 

A method of creating a play list in the present 
invention will be described in more detail below with 
reference to FIGS. 20 and 21. 

10 Information of management information recording 

area 130 is read, and is temporarily recorded in the 
work RAM in main MPU 404 (step S41 in FIG. 20). 

The edit window (windows of original track 1 and 
play lists) is presented to the user by the method 

15 shown in FIG. 19 (step S42), and the user creates a 

play list (step S43). In this case, the user inputs 
the relationship between a new track to be created and 
the original track while observing the window. 

A display mode is automatically set to match that 

20 designated by the source original track to be played 

back first, but the user can change it later while 
observing the window. At the same time, the user 
inputs unique information that pertains to the new 
track created on the play list (step S44). 

25 That is, in this step, the user inputs display 

mode 8 (FIG. 6B) that pertains to the new track to be 
created, a representative picture, and a still picture 
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setup condition (in case of "original" , the same still 
pictures as those set in the original track are 
displayed; in case of "newly set", the user designates 
new still pictures), while observing the display window. 
5 Main MPU 404 in the information recording/playback 

apparatus shown in FIG. 14 generates new cell 
information and records information in track head entry 
points in the cell information on the basis of the 
input information (step S45). 

10 That is, in this step, new cell information 164 

to cell information 169 are additionally set in 
corresponding user-defined PGC information 156 and 
user-defined PGC information 157, and track head entry 
points 171 to 173 are additionally recorded in cell 

15 information corresponding to a cell which is located at 

the head position in the new track set by the user in 
the work RAM. 

The display mode designated by the user, 
designation information of a representative picture, 

2 0 and the display range of representative audio are 

additionally recorded in track head entry points 171 to 
173 (step S46) . 

As the display range of representative audio in 
step S46, a specific time range (e.g., 5 sec) after the 

25 playback start time of the newly created track is 

automatically recorded. When the user wants to change 
this time range, he or she can re-set the display range 
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of representative audio in an edit process. 

If the user re-sets still pictures to be displayed 
simultaneously with playback with respect to the newly 
created track (YES in step S47 of FIG- 21), the 
5 contents of still 10 in FIG . 6B are changed from 

"Original" to "Newly Set" in correspondence with the 
user's key-in result, and the user selects still 
pictures (step S48). That is, the user selects still 
pictures to be displayed simultaneously with display of 

10 the newly created track from the existing still picture 

VOB group information (273 to 279). 

If the user does not newly designate still 
pictures to be displayed (NO in step S47), it is 
determined whether all still pictures in the original 

15 track are to be displayed (step S49). When all still 

pictures of the original track are to be displayed (YES 
in step S49), the process goes to following (A). If 
all still pictures of the original track are not to be 
displayed (NO in step S49), the process goes to 

2 0 following (B) . 

As the method of setting still pictures upon 
creating a play list, the embodiment of the present 
invention selects one of the following two methods. 

(A) All still pictures of the original track of 

25 interest are displayed upon playing back a new track in 

the play list irrespective of the designated ranges in 
original tracks by the user (step S51). 
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For example, when a new track is created from 
three original tracks, all still pictures displayed 
upon playing back the three original tracks are 
displayed upon playing back the new track. 
5 (B) Only still pictures displayed upon playing 

back the designated ranges in original tracks quoted to 
create a new play list are displayed upon playing back 
a new track (step S50). 

At this time, a first characteristic feature in an 

10 edit method of the present invention lies in that 

discrimination information indicating the method to be 
selected is recorded in advance in management 
information, and one of (A) and (B) is selected based 
on that information (step S49). 

15 A second characteristic feature of the present 

invention lies in that the user can recognize the 
discrimination information. If the user can recognize 
that information, he or she can understand the selected 
one of (A) and (B), thus avoiding user's confusion. 

2 0 As the discrimination information that the user 

can recognize, the present invention uses display mode 
7 shown in FIG. 6A. More specifically, only when 
display mode 7 of original track 1 (corresponding to 
"Automatic" of the original track in new track No. 1, 

25 "First Love" of the original track in new track No. 3 

in the example in FIGS. 6A and 6B) indicates "Slideshow 
+ Sequential" ("display mode of still pictures in the 
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corresponding audio track" in the program information 
at that time also records the same information) , (B) is 
selected; when information other than the above 
information is recorded, (A) is selected. 
5 After step S50 or S51, main MPU 404 divides the 

display time of the corresponding new track by the 
number of still pictures to be displayed in the new 
track in order to compute the display time per still 
picture (step S52)* The computed value is recorded in 
10 "information for designating the display timing of 

still picture" in still picture entry points 21 to 26 
and track head entry points 171 to 173. These entry 
points are temporarily recorded in the work RAM of 
MPU 404. 

15 Then, the management information temporarily 

recorded in the work RAM is rewritten in management 
information recording area 130 of disc 100 via drive 
409 (step S53) . 

Although not shown in FIGS. 2 0 and 21, immediately 

2 0 after "information for designating the display timing 

of a still picture" is set (step S52), "text 
information unique to the corresponding audio track" 
and "additional comment text information" in original 
track 1 from which audio information to be played back 

2 5 first upon playing back the new track is quoted are 

automatically transferred as those to be recorded in 
track head entry points corresponding to the newly 
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created track, and text information is rewritten as 
needed by user's key-in operation (step S53). 

FIGS . 22 and 23 show the processing sequence when 
video information is used as still picture information 
5 to be displayed simultaneously with audio information. 

Information in management information recording 
area 130 is read, and is temporarily recorded in the 
work RAM in main MPU 404 (step S61 in FIG. 22). The 
windows of original track 1 and play lists are 

10 displayed by the method shown in FIG. 19 (step S62). 

The user designates a track for which still picture 
information extracted from video information is to be 
simultaneously displayed while observing the displayed 
window (step S63). 

15 The user then designates a scene to be extracted 

as still picture information from video information 
(that in file VR_MOVIE . VRO 212 that records movie 
object information) while observing the displayed 
window (step S64). 

20 Scene information designated in this step is 

extracted as a still picture, which is recorded as a 
part of still picture object file AR_STILL. ARO 213 on 
information storage medium 100 from disc drive 409 via 
V-PRO 438 and video mixing unit 405 (step S65). 

25 In correspondence with the still picture extracted 

and recorded on the disc in this step, new still 
picture VOB group information #g/279 and still picture 
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VOB entries #1/298 and #2/299 are recorded in the work 
RAM in main MPU 404 (step S66 in FIG- 23)- 

Still picture information designated by track head 
entry point #2/172 and still picture entry point 24 in 
5 cell information #4/167 corresponding to the track 

designated by the user in step S63 is changed to still 
picture VOB entries #1/298 and #2/299 generated in step 
S66 (step S67 ) * 

The display time of the corresponding track is 

10 divided by the number of still pictures to be displayed 

within that track to compute a display time per still 
picture , and the computation result is recorded in 
"information for designating the display timing of a 
still picture" in still picture entry point 24 and 

15 track head entry point 172, which are temporarily 

stored in the work RAM (step S68). 

The management information temporarily recorded in 
the work RAM in main MPU 4 04 is rewritten in management 
information recording area 130 via disc drive 409 

20 (step S69) . 

A characteristic feature of the present invention 
lies in that an arbitrary video screen image in file 
VR__MOVIE.VRO 212 that records video information (movie 
object) , as shown in (h) of FIG. 10, can be used as a 

25 still picture to be displayed simultaneously with 

playback of audio information. An example of the 
method will be described below using FIGS. 10, 22, 



- 68 - 

and 23. 

A list of audio tracks recorded on information 
storage medium 100 is presented to the user, as shown 
in FIGS. 6A and 6B, and the user designates a track for 
5 which still pictures are to be set (step S63). The 

user then designates a desired screen image (scene) 
while displaying video information recorded in file 
VR_MOVIE.VRO 212 (step S64). Since the screen image 
(scene) designated by the user has already been decoded 

10 by decoder unit 4 02 in FIG. 14, that picture informa- 

tion is directly recorded as a still picture 
(I-picture) in file AR_STILL . ARO 213 (step S65), and 
management information that pertains to the still 
picture is generated (step S66). Display related 

15 information between the generated still picture and 

audio information is recorded in track head entry 
pointer #2/172 or still picture entry point 24. 

A normal user sets a desired screen image (scene) 
but does not often set the display timing. Therefore, 

20 a characteristic feature of the information 

recording/playback apparatus in the present invention 
lies in that main MPU 4 04 in the information 
recording/playback apparatus shown in FIG. 14 
automatically sets the value of "information for 

25 designating the display timing of a still picture" 

(FIGS. 9A and 9B) (FIG. 22, step S68 in FIG. 23). 

More specifically, main MPU 404 automatically 



computes : 

[display time per still picture] = [playback time 
of corresponding audio track] -r- [the number of still 
pictures to be displayed in corresponding audio track] 

In original PGC 320 that manages information of an 
original track, information that pertains to an 
original track shown in FIGS . 9A and 9B is recorded in 
program information (#1/311 to #4/314). 

A method of playing back audio related information 
recorded on information storage medium 10 0 by the 
aforementioned method will be explained below using 
FIGS . 24 and 25. 

Information in management information recording 
area 130 is read, and is temporarily recorded in the 
work RAM in main MPU 4 04 (step S71 in FIG. 24). 
Information that pertains to an original track recorded 
on the information storage medium is interpreted on the 
basis of the program information (311 to 314) 
temporarily recorded in the work RAM to generate 
display window contents that pertain to original track 
1 (step S72 ) . 

Information associated with tracks in units of 
play lists is extracted from information of track head 
entry points 171 to 173 in cell information 164 to cell 
information 169 that form user-defined PGC information 
table 145 temporarily recorded in the work RAM, thus 
generating display window contents which pertain to 



play list 2 (step S73). 

The display windows generated in steps S72 and S73 
in FIG. 24 are composited, and the composite window is 
transferred to the V mixing unit (step S74). The 
display window generated in main MPU 404 is displayed 
on display 408 via D/A converter 436 (step S75). 

The window shown in FIGS. 6A and 6B is displayed 
by the method shown in FIG. 19, and the user selects a 
track to be played back (step S76). 

In step S77 of FIG. 25, the playback start and end 
times of representative audio are read from the 
"information for designating the display range of 
representative audio indicating the playback contents 
of the corresponding audio track" in track head entry 
points 171 to 173 or program information 311 to program 
information 314 • 

On the other hand, in step S7 8 the playback start 
and end addresses in AR_AUDIO.ARO 221 that records 
representative audio information are computed using 
information in audio object unit entries 241 to 248 in 
audio object information 196 to audio object 
information 197. 

In step S79, information in the predetermined 
address range is played back and is output as sound. 
The user listens to that representative audio to check 
if it is an audio track he or she wants to listen to. 
After confirmation, the user designates the playback 



range and presses the playback button (step S80). 

That range in original PGC information 144 or 
user-defined PGC information 155 to user-defined PGC 
information 157 , which corresponds to the track range 
designated by the user is discriminated from the 
management information temporarily stored in the work 
RAM (step S81) . 

Object information is played back and displayed in 
units of tracks from the disc in accordance with the 
order program information 311 to program information 
311 or cell information 164 to cell information 169 are 
arranged in original PGC information 144 or user- 
defined PGC information 155 to user-defined PGC 
information 157 (step S82). 

According to the method of FIGS. 24 and 25 , a user 
can listen to representative audio before he or she 
selects a tune he or she wants to listen to, thus 
confirming in advance if it is the tune he or she 
really wants to listen to. That is, when the user 
designates an audio track to be confirmed and presses a 
playback button of representative audio, main MPU 404 
computes the addresses to be accessed in AR_AUDIO.ARO 
221 using audio object unit entries AOBU_ENT #1/241 to 
AOBU_ENT #h/248 (step S78) from "information for 
designating the display range of representative audio 
indicating the playback contents of the corresponding 
audio track" (step S77 in FIG . 25), and plays back and 
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displays audio information from the information storage 
medium (step S79). 

As described above, in the present invention, main 
MPU 404 in the information recording/playback apparatus 
5 discriminates that range in each object file, which 

corresponds to the range designated by the user (step 
S77), and plays back and displays based on the 
discrimination result. At this time, a characteristic 
feature of the present invention lies in that object 

10 information is played back in units of tracks from 

information storage medium in accordance with the order 
program information 311 to program information 311 or 
cell information 164 to cell information 169 are 
arranged in original PGC information 144 or user- 

15 defined PGC information 155 to user-defined PGC 

information 157. 

The gist implemented by the present invention is 
summarized as follows. 

Break information of audio tracks is recorded in 

20 PGCI. The track break information is provided with 

text information and a representative picture of the 
track. Program information is recorded in units of 
original tracks. 

A track head entry point in cell information 

25 indicates a break of tracks. Playback is made in units 

of tracks in accordance with the order a plurality of 
pieces of program information/cell information 
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described in the PGCI are arranged. The display 
timings are automatically computed in correspondence 
with still pictures designated by the user. A display 
method of simultaneously displaying an original list 
5 and play list on the screen (also applies to RTR) ♦ 

Furthermore, an edit method of creating tracks on 
the play list using original track information. Still 
pictures of an original track to be pasted on a track 
of the play list are determined. Still pictures to be 

10 displayed after partial erase are determined. An edit 

method of extracting an arbitrary scene of a movie 
object as a still picture and displaying it 
simultaneously with an audio object, and so forth. 

FIG. 2 6 shows an example of the data structure of 

15 a part (UD_PGCIT 145 shown in (e) of FIG. 1 ) of a real 

time recording audio manager { RTR_AMG 130 shown in (d) 
of FIG. 1) . 

As shown in FIG. 26, real time recording (RTR) 
audio manager RTR_AMG (130 in (d) of FIG. 1) includes 

20 RTR audio manager information RTR_AMGI, audio file 

information table AUDFIT, audio still video file 
information table ASVFIT, original program chain (PGC) 
information ORG_PGCI (144 in (e) of FIG. 1), user- 
defined PGC information table UDJPGCIT (145 in (e) of 

25 FIG. 1), text data manager TXTDT_MG (146 in (e) of 

FIG. 1), and manufacture's information table MNFIT (147 
in (e) of FIG. 1) . 
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User-defined PGC information table UD_PGCIT 
includes user-defined PGC information table information 
UD_PGCITI (150 in (f) of FIG. 1), one or more user- 
defined PGC information search pointers UD_PGCI_SRP #1 
5 to #n (151, 152 in (f) of FIG. 1), and one or more 

pieces of user-defined PGC information UD__PGCI #1 to #n 
(156, 157 in (f) of FIG . 1). 

There are two type of PGC information. One is an 
original PGC managed by ORG__ PGCI, and the other is one 
10 or more user-defined PGCs managed by UD_PGCIT. 

The original PGC describes a playback sequence of 
originally recorded contents (cells). The user-defined 
PGC describes a playback sequence which can be obtained 
by freely arranging (or modifying) the order of 
15 playback of cells (originally recorded contents) by a 

user. 

FIG. 27 shows an example of the data structure of 
a program chain information (PGC Information) contained 
in the real time recording audio manager (RTR_AMG) 

20 shown in FIG. 26. 

As shown in FIG. 27, PGC information (ORG__PGCI or 
one of UD_PGCIs) #i includes PGC general information 
PGC_GI (160 in (g) of FIG. 1; cf. FIG. 48), one or more 
pieces of program information PGI #1 to #n (311 to 313 

25 in (d) of FIG. 12; cf. FIG. 49), one or more cell 

information search pointers CI_SRP #1 to #n (161, 162 
in (g) of FIG. 1), and one or more pieces of cell 
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information CI #1 to #n (164-169 in (g) of FIG. 1, or 
164 to 169 in (d) of FIG. 7). 

Each cell information CI includes cell general 
information C_GI and one or more pieces of cell entry 
5 point information C_EPI #1 to #n (171-173 in (h) of 

FIG. 1, or the entry points in (c) of FIG. 7). 

Although not shown, cell general information C_GI 
contains information items of: 

(1) C_TY describing a type of the cell (e.g., 
10 C__TY1 = "010b" is described for Audio Cell); 

(2) PB_INF describing playback information (e.g., 
on/off information of dynamic range control, and a 
dynamic range control value) of the cell; 

(3) AOBI_SRPN describing the AOBI search pointer 
15 number of the corresponding AOB of the cell; 

(4) ASVUI_SRPN describing the ASVUI search 
pointer number of the corresponding ASVU of the cell; 

(5) ASV_DMD describing the display timing mode 
and display order mode of the ASVU corresponding to the 

20 cell; 

(6) C_EPI__Ns describing the number of cell entry 
points C_EPIs; 

(7) C_A__S_PTM describing the presentation start 
time of the cell; and 

2 5 (8) C_A_E_PTM describing the presentation end 

time of the cell. 

FIGS. 28A and 2 8B respectively show examples of 



- 76 - 

entry points in an original PGC and in a user-defined 
PGC. 

FIG. 28A shows an example of entry points in the 
original PGC- In FIG. 28A, there are three programs PG 
5 #1 to PG #3. Each of these programs has only one cell, 

In the cell of PG #1, there are seven entry points in 
total. Three entry points (1) to (3) of PG #1 are for 
an index , and four entry points [1] to [4] thereof are 
for a display list. 

10 Note that the entry points for an index of PG #1 

to PG #3 are indicated by the arrows over the boxes of 
PG #1 to PG #3, and the index number representing these 
entry points are indicated by the numbers enclosed in 
the circles. Thus, the entry points for index include 

15 the value of index number as additional information. 

The arrows under the boxes of PG #1 to PG #3 
indicate the entry points for a display list. When 
there is an audio still video (ASV) to be presented 
with audio data, the track has information about the 

20 number of audio still video unit (ASVU) which includes 

the audio still video to be presented together. Each 
entry point has information about the number of audio 
still video object (ASVOB) in the specified ASVU. 
Specified ASVOB is presented at the timing of the entry 

25 point. 

FIG. 28A illustrates an example of slideshow/ 
sequential mode. In case of slideshow/random or 
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shuffle, each entry point has no information about the 
number of ASVOB, because the number of ASVOB to be 
presented shall be decided at random or by shuffle. 
When the ASV display mode of the track is a browsable 
5 mode, timing information of the entry points for audio 

still video (ASV) is all set to zero. This is because, 
in case of the browsable mode, a user can skip to the 
next or previous audio still video at any timing, and 
predetermined timing information is not necessary. 

10 FIG. 28B shows an example of entry points in the 

user defined PGC. In case of a user defined PGC, the 
PGC contains no program (PG) structure and the PGC 
contains only a cell structure. Therefore, a user 
defined track is not realized by a program (PG) 

15 structure. A new entry point <T> for the user defined 

track is introduced in case of the user defined PGC. 

In FIG. 28B, three cells are illustrated : cell #1, 
cell #2, and cell #3. The illustrated cell #1 and cell 
#2 have the new entry points <T> for the user defined 

20 track. In the example of FIG. 28B, cell #1 corresponds 

to one user defined track, and cell #2 and cell #3 
correspond to another user defined track. The entry 
point for user defined track shall be set to the start 
point of the cell. 

2 5 As for an entry point for the index and an entry 

point for the display list, there are the same as case 
(FIG. 28A) of the original PGC. 
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Incidentally, the numbers (1 to 5) or characters 
(T) with downward arrows correspond to the track head 
entry points in (c) of FIG. 7, and the numbers (1-4) 
with upward arrows correspond to the still picture 
5 entry points in (c) of FIG. 7. 

FIG. 29 shows an example of contents of cell entry 
point information (C_EPI of type Al ; C_EPI# shown in 
FIG. 27 or (h) of FIG. 1). 

As shown in FIG. 29, C_EPI of type Al includes 
10 information items of: 

( 1 ) EP_TY describing a type of the entry point 
(this EPJTY is formed of EP__TY1 and EP_TY2, and the 
type Al is indicated by a combination of the contents 
of EP_TY1 = "01b M and EP_TY2 = "00b".); 
15 (2) EP_PTM describing the presentation time of 

the entry point (all byte of this EP_PTM shall be set 
to "00h"); 

(3) PRM_TXT describing the primary text 
information for the entry point (PRM_TXT is divided 

20 into two sub fields: the first 64 byte field is used to 

describe a primary text in, e.g., ASCII character set, 
the last 64 byte field is used to describe a primary 
text in another character set defined in RTR_AMGI shown 
in FIG. 26 , etc. ) ; 

25 (4) IT__TXT__SRPN describing the search pointer 

information (search pointer number) of item text IT_TXT 
whose text data corresponds to the entry point; and 
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(5) REP_PICTI describing representative picture 
information (see FIG. 50). 

Note that EP_TY of type Al corresponds to EP_TY 
described in the first paragraph of the "information 
5 contents" of the table shown in FIG, 9A. 

Note that EP__PTM of type Al corresponds to EP__PTM 
described in the 8th paragraph of the "information 
contents" of the table shown in FIG. 9A. 

Note that PRM_TXT of type Al corresponds to 
10 PRM_TXTI described in the 5th paragraph of the 

"information contents" of the table shown in FIG. 9A. 

Note that IT_TXT_SRPN of type Al corresponds to 
IT_TXT_SRPN described in the 6th paragraph of the 
"information contents" of the table shown in FIG. 9A. 
15 Also note that REP_PICTI of type Al corresponds to 

REP_PICTI described in the third paragraph of the 
"information contents" of the table shown in FIG. 9A. 

FIG. 30 shows an example of the data structure of 
another part (AUDFIT 14 3 shown in (e) of FIG. 3) of the 
20 real time recording audio manager (RTR_AMG 130 shown in 

(d) of FIG. 3) . 

As shown in FIG. 30, audio file information table 
AUDFIT in RTR_AMG includes audio file information table 
information AUDFITI (180 in (f) of FIG. 3), one or more 
25 pieces of audio stream information AUD_STI #1 to #n 

(181, 182 in (f) of FIG. 3), one or more pieces of down 
mix coefficient information DM COEFI #1 to #m, and 
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audio file information AUDFI (184 in (f) of FIG. 3). 

Although not shown, AUDFITI contains information 
items of: 

( 1 ) AUDFI_Ns describing the number of pieces of 
5 audio file information AUDFI; 

(2) AUD_STI_Ns describing the number of pieces of 
audio stream information AUD_STI; 

(3) DM_COEFI_Ns describing the number of pieces 
of down mix coefficient information DM_COEFI; and 

10 (4) AUDFIT__EA describing the end address of audio 

file information table AUDFIT with the relative block 
number from the first byte of the AUDFIT. 

Note that AUDFI_Ns indicates the number of audio 
files (AR_AUDIO.ARO 221 in FIG. 2). Since the maximum 

15 number of the audio files is "1" , AUDFI_Ns shall take 

the value "0" or "1". Also note that the maximum 
number of AUD_STI_Ns is "64", and the maximum number of 
DM_COEFI_Ns is "16". 

Although not shown, each AUD_STI contains 

2 0 information items of: 

(a) A_ATR describing the audio attribute of the 
AOB which refers to the AUD_STI; and 

(b) TXT__ART describing the text attribute (e.g., 
ASCII, etc.) of real time text data RT_TXTDT included 

25 in the AOB which refers to the AUD__STI. 

The above A_ATR includes information items of: 
(al) Audio coding mode indicating any of Linear 



PCM mode, Packed PCM mode/Lossless-compressed mode, 
etc . ; 

(a2) Q of CH_GR1 describing the quantization word 
length (16-bit, 20-bit, 24-bit, etc.) of the channel 
5 group 1 (CH_GR1) of Linear PCM audio or the source data 

of Packed PCM audio; 

(a3) Q of CH_GR2 describing the quantization word 
length (16-bit, 20-bit, 24-bit, etc.) of the channel 
group 2 (CH_GR2) of Linear PCM audio or the source data 
10 of Packed PCM audio; 

(a4) fs of CH_GR1 describing the sampling 
frequency (48 kHz, 96 kHz, 192 kHz, 44.1 kHz, 88.2 kHz, 
176.4 kHz, etc.) of the channel group 1 (CH_GR1) of 
Linear PCM audio or the source data of Packed PCM 
15 audio; 

(a5) fs of CH_GR2 describing the sampling 
frequency (48 kHz, 96 kHz, 192 kHz, 44.1 kHz, 88.2 kHz, 
176.4 kHz, etc.) of the channel group 2 (CH_GR2) of 
Linear PCM audio or the source data of Packed PCM 
2 0 audio; 

(a6) Multi-channel type describing the type of 
the multi-channel source of Linear PCM audio or the 
source data of Packed PCM audio related to: 

* Channel assignment including the number of 
25 channels, and 

* Down-mix method (for Linear PCM only); and 
(a7) Cannel assignment describing the assignment 
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of each channel of Linear PCM audio data or Packed PCM 
audio data in the AOB. 

The above real time text data (RT_TXTDT) can be 
recorded in, e.g., AR real-time text object recording 
5 area 134 shown in (d) of FIG. 1. 

Although not shown, down mix coefficient 
information DM_COEFI includes down mix coefficient 
table DM_COEFT. Contents of DM_COEFT can be used to 
determine the coefficients to mix down the Linear PCM 
10 audio data from multi-channel to 2-channel. This 

information is used only when audio data is coded as 
Linear PCM. 

Note that the same DM_COEFI may be shared by 
multiple AOBs. 

15 As shown in FIG. 30, audio file information AUDFI 

includes audio file information general information 
AUDFI_GI (190 in (g) of FIG. 3), one or more audio 
object information search pointers AOBI_SRP #1 to #n 
(191, 192 in (g) of FIG. 3). and one or more pieces of 

2 0 audio object information AOBI #1 to #n (196, 197 in (g) 

of FIG. 3) . 

Although not shown, AUDFI__GI contains AOBI_SRP_Ns 
describing the number of A0BI_SRPs. Note here that the 
minimum and maximum numbers of AOBs in the audio file 
25 is "1" and "999", respectively. 

The search pointer AOBI_SRP contains AOBI_SA 
describing the start address of the AOBI with relative 
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block numbers from the first byte of AUDFIT. 

Each audio object information AOBI contains AOB 
general information AOB_GI and AOB unit information 
AOBUI . 

5 Although not shown, AOB_GI (corresponding to 240 

in (h) of FIG. 3) contains information items of: 

(1) AOB_TY describing the type of AOB, and AOB_TY 
includes temporary erase flag TE (TE = "Ob" indicates 
that the AOB is in normal state, and TE = "lb" 
10 indicates that the AOB is in temporarily erased state. 

An AOB in temporarily erased state shall not be 
referred to by a cell in a user-defined PGC, nor be 
presented in a normal playback operation such as a 
track play. ) ; 

15 (2) AOB_CNT describing the contents (AUD_STI 

number AUD_STIN, bit-shift of CHGR2, stereo playback 
mode, DMCOEFI number DMCOEFIN, DMCOEFIN validity 
describing whether DMCOEFIN is valid or not) of the 
AOB; 

2 0 (3) AOB_REC__TM describing the recording time (the 

time when the head of audio data of the AOB was 
recorded) of the AOB; 

(4) AOB_REC_TM_SUB describing the sub-second 
information for AOB_REC__TM; 

25 (5) AOB_A_S_PTM describing the presentation 

starting time of the first audio frame, which is coded 
as presentation time stamp PTS, of the AOB (when no PTS 
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is presented in the AOB , the presentation starting time 
shall be calculated in the manner of MPEG 
specification) ; and 

( 6 ) AOB_A_E_PTM describing the presentation 
5 terminating time of the last audio frame of the AOB, 

As shown in FIG . 30, AOBUI contains audio object 
unit general information AOBU_GI, and one or more AOBU 
entries AOBU_ENT #1 to #n (241 to 248 in (h) of FIG- 3) 
Although not shown, AOBU_GI (also corresponding to 
10 2 40 in (h) of FIG. 3) contains information items of: 

( 1 ) AOBU_PB_TM describing the playback time of 
one AOBU; 

(2) AOBU_SZ describing the size of AOBU (the size 
is specified by the number of data packs in AOBU; 

15 (3) Ii_AOBU_PB__TM describing the playback time of 

the last AOBU of the AOB; 

(4) L_AOBU__SZ describing the size of the last 
AOBU in the AOB; 

(5) AOBU_ENT_Ns describing the number of AOBU 
20 entries in the AOBUI; and 

(6) AOB_SA describing the start address of the 
AOB with relative logical block numbers from the first 
logical block of the AR__AUDIO.ARO file (221 in FIG. 2). 

Note that the playback time of AOBU shall be equal 
25 to or less than 1 second and, therefore, AOBU_PB_TM 

describes the shortage of the AOBU playback time from 
1 second. 
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FIG. 31 shows an example of contents of an audio 
object unit entry (A0BU_ENT) corresponding to AOBUJENT 
#n (n = integer number) shown in FIG. 30, or 
corresponding to 241 to 248 in (h) of FIG. 3. 
5 As shown in FIG. 31, each A0BU_ENT contains 

AOBU__SZ describing the size of the AOBU. This AOBU 
size is specified by the number of data packs in the 
AOBU. When each pack has 204 8 bytes (or 2k bytes), the 
size of AOBU becomes an integer number of 2048 bytes 
10 (or an integer number of 2k bytes). 

FIG. 32 illustrates a concept of AOBU accesses for 
presenting contents (audio frames) of audio object 
units AOBUs . 

In the example of FIG. 32, three AOBs (AOB #1 to 
15 #3) are recorded in the AR_AUDIO.ARO file. The first 

data of each AOB is accessed by specifying the relative 
logical block number inside the file. Therefore, in 
order to access AOB #2, AOB #2 start address is 
described in the data field of AOBI for AOB #2. The 
20 start address for AOB #1 becomes "0" in the relative 

logical block number inside the file. 

AOB #2 in FIG. 32 is formed of a sequence of AOBUs 
starting at AOBU #1. Each AOBU is formed of a sequence 
of packs. A pack is a unit containing the divided 
25 audio and text data for multiplexing. Inside the 

AR_AUDI0.AR0 file, all packs shall be recorded 
contiguously in the sense of the relative block number 
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inside the file. Therefore, in order to access AOBU #i 
in AOB #2, the start address of AOBU #i should be 
obtained. This is computed by adding the AOB #2 start 
address and the relative start address inside AOB #2. 
5 This is the basic mechanism to access AOBU using the 

data structure of AOBI. 

Incidentally, AOBs #1, #2, #3, ... of the 
AR__AUDIO.ARO file shown in FIG. 32 correspond to 196 to 
197 shown in (g) of FIG. 3. AOBUs #1, #2, #3, ... of 

10 the AOBU data sequence shown in FIG. 32 correspond to 

241 to 248 shown in (h) of FIG. 3. Information item 
251 shown in (I) of FIG. 3 corresponds to the audio 
frames in AOBU #i (i = 1, 2, 3, ...). 

Any presentation of AOB and AOBU can be specified 

15 by the presentation start time and presentation end 

time . 

The presentation start time of AOB and AOBU is 
defined using time stamps described in data packs of 
the AOB. For instance, the first audio frame of audio 

20 packs in each AOBU has its presentation time in the PTS 

field of the packet header (not shown). In order to 
decode and present data of AOB or AOBU, the reference 
clock (e.g., STC1 in FIG. 14) inside the decoder (402 
in FIG. 14) is set to the SCR value described in the 

25 first pack (not shown) at which the presentation begins, 

and then the clock is being counted automatically. 
Based on the clock, the presentation of an AOB or AOBU 



is performed. 

FIG. 33 illustrates a concept of AOBU entries 
(AOBU_ENT#) . 

If coding of the audio elementary stream is in a 
variable bit-rate, an AOBU entry having a structure to 
store size information for each AOBU is prepared for 
every AOBU. This is because the number of data packs 
of an AOBU is flexible. 

In the case of a constant bit-rate , an AOBU entry 
is not defined. AOBU entries associated with an AOB 
are describing in ascending order of presentation time 
associated with the AOBU. 

In order to minimize the table size, each AOBU 
entry contains only the number of packs (AOBU_SZ in 
FIG. 33). Using this information (AOBU_SZ it is 

possible to compute which AOBU corresponds to a given 
presentation time. This is because the presentation 
period of each AOBU except for the last AOBU is always 
constant. 

FIG. 34 shows an example of the data structure of 
still another part (ASVFIT 142 shown in (e) of FIG. 4) 
of the real time recording audio manager (RTR_AMG 130 
shown in (d) of FIG. 4). 

As shown in FIG. 34, audio still video file 
information table ASVFIT (142 in (e) of FIG. 4) 
includes audio still video file information table 
information ASVFITI (260 in (f) of FIG. 4), audio still 
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video stream information ASV_STI #1 to #n (261 to 2 62 
in (f) of FIG. 4), and audio still video file 
information ASVFI (2 64 in (f) of FIG. 4). 

Although not shown, ASVFITI contains information 
5 items of: 

(1) ASVFI_Ns describing the number of pieces of 
audio still video file information ASVFI; 

(2) ASV_STI__Ns describing the number of pieces of 
audio still video stream information ASV_STI; and 

10 (3) ASVFIT_EA describing the end address of audio 

still video file information table ASVFIT with the 
relative block number from the first byte of ASVFIT. 

Each audio still video stream information ASV_STI 
contains video attribute V_ATR. This V_ATR describes 
15 the video attribute of one or more audio still video 

units (ASVU or ASVUs) which refer to the ASV_STI . 

Although not shown, V_ATR contains information 
items of : 

(a) Video compression mode indicating MPEG-1, 
2 0 MPEG-2, etc.; 

(b) TV system indicating 525/60 (NTSC), 625/50 
( PAL ) , etc . ; 

(c) Aspect ratio indicating 4:3, 16:9, etc.; 

(d) Video resolution such as 720 X 480, 
25 544 X 480, etc. 

As shown in FIG. 34, audio still video file 
information ASVFI includes ASVFI general information 
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ASVFI_GI (2 70 in (g) of FIG. 4), one or more ASVUI 
search pointers ASVUI_SRP #1 to #n (271 to 272 in (g) 
of FIG. 4), and one or more pieces of ASVU information 
ASVUI #1 to #n (273 to 279 in (g) of FIG. 4). 
5 Although not shown, ASVFI__GI contains ASVUI__SRP_Ns 

describing the number of ASVUI search pointers 
ASVUI_SRPs. Note that the minimum number of ASVUs in 
the audio still video file is "1" and the maximum 
number thereof is "999". 

10 Although not shown, each ASVUI_SRP contains 

ASVUI_SA describing the start address of audio still 
video unit information ASVUI with the relative block 
number from the first byte of the ASVFIT. 

As shown in FIG. 34, ASVU information ASVUI 

15 contains ASVU general information ASVU_GI (280 in (h) 

of FIG. 4) and one or more audio still video object 
entries ASVOB__ENT #1 to #n (281 to 289 in (h) of 
FIG. 4). 

Although not shown, ASVU__GI contains information 
20 items of: 

( 1 ) ASVOB_Ns describing the number of ASVOB in 
the ASVU; 

(2) ASV__STIN describing the ASV_STI number of the 
ASVU (note that more than one ASVUs may share the same 

2 5 ASV_STI); 

(3) FIRST_ASVOB_REC_TM describing the time when 
the first ASVOB in the ASVU was recorded; 
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(4) LAST_ASVOB_REC_TM describing the time when 
the last ASVOB in the ASVU was recorded (note that 
FIRST_ASVOB__REC_TM shall be earlier than 
LAST_ASVOB_REC_TM in the same ASVU); and 
5 (5) ASVU_SA describing the start address of ASVU 

with the relative logical block number from the first 
logical block of the AR_STILL . ARO file (213 in FIG- 2). 

Although not shown, each ASVOB__ENT contains 
information items of : 
10 (a) ASVOB_ENT_TY describing the type of ASVOB 

entry; and 

(b) ASVOB__SZ describing the size of ASVOB in 
logical blocks. 

Note that ASVOB_ENT_TY includes temporary erase 
15 flag TE (where TE = "00b" indicates that the ASVOB is 

in a normal state, and TE = "01b" indicates that the 
ASVOB is in a temporarily erased state). 

FIG. 35 shows an example of contents of an audio 
still video object entry ( AS VOB_ENT ) corresponding to 
20 ASVOB__ENT #n (n = integer number) shown in FIG- 34, or 

to ASVOB_ENT #1 shown in FIG. 4. 

As shown in FIG. 35, ASV0B_ENT contains 
information items of : 

(a) ASVOB_ENT_TY describing the type of ASVOB 
25 entry; and 

(b) ASVOB_SZ describing the size of ASVOB in 
logical blocks. 
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ASVOB_ENTJTY includes temporary erase flag TE. 
Here, TE = "00b" indicates that the ASVOB is in a 
normal state, and TE = "01b" indicates that the ASVOB 
is in a temporarily erased state. 
5 FIG. 36 shows an example of the data structure of 

the audio still video object (ASVOB) . 

As shown in FIG. 36, each ASVU (here ASVU #2) of 
the AR_STILL • ARO file is formed of one or more ASVOBs 
(here ASVOB #1 to #n) . Each ASVOB (here ASVOB #2) 

10 includes a dummy pack, one or more video packs VJPCKs, 

and a logical block for MPEG_ program _end_code. Only 
one the dummy pack shall exist at the start of ASVOB. 
The dummy pack contains a system header and some 
additional information (such as recording information 

15 consisting of ISRC code, copy control information, etc., 

not shown) . The V__PCK complies with MPEG program 
stream and contains only an I-picture of MPEG video 
data followed by a sequence_end_code . Each ASVOB 
shall terminate with the logical block for 

2 0 MPEG_progr am__end_code . 

Incidentally, the ASVUs shown in FIG. 36 
respectively correspond to 273 to 279 in (g) of FIG. 4, 
and the ASVOBs shown in FIG. 36 respectively correspond 
to 281 to 289 in (h) of FIG. 4. 

25 FIG. 37 illustrates a concept of ASVOB accesses. 

The ASVUs and ASVOBs shown in FIG. 3 7 respectively 
correspond to the ASVUs and ASVOBs shown in FIG. 36. 



Audio still video ASV is still picture data to be 
presented together with audio data by a video capable 
player (abbreviated as a VCAP). Audio still video 
object ASVOB is composed of only one still picture 
without a button (a visual item for a user selection), 
and audio still video unit ASVU is the collection of 
one or more (up to "99") ASVOBs. 

In order to access an ASVOB recorded in •ARO file, 
ASVUI is used to obtain address information with 
respect to the file. Each ASVUI is formed of address 
information of ASVOBs included in the ASVU. 

FIG. 37 shows the basic concept of ASV and its 
ASVOB access. In FIG . 37, three ASVUs (ASVU #1 to #3) 
are recorded in the AR_S TILL. ARO file. ASVU #2 
consists of a sequence of ASVOBs starting at ASVOB #1. 
To access ASVOB #i in ASVU #2, the player (VCAP) 
obtains the start address of ASVU #2 in the 
AR_STILL . ARO file and the start address of ASVOB #i in 
ASVU #2. Then, the player sums up the two start 
addresses to get the start address of ASVOB #i in the 
AR_STILL.ARO file. 

FIG. 3 8 shows an example of the data structure of 
yet another part (TXTDT_MG 146 shown in (e) of FIG. 5) 
of the real time recording audio manager ( RTR__AMG 130 
shown in (d) of FIG. 5). 

As shown in FIG. 38, text data manager TXTDT_MG 
includes text data information TXTDTI (231 in (f) of 



FIG. 5), one or more item text search pointers 
IT_TXT_SRP #1 to #n (232 to 233 in (f) of FIG. 5) f and 
one or more item texts ITJFXT (236 to 238 in (f ) of 
FIG. 5). 

Although not shown, TXTDTI contains information 
items of : 

(1) CHRS describing the character set code (ASCII 
code, etc.) to be used in the TXTDT_MG; 

(2) IT_TXT_SRP_Ns describing the number of 
I T_TXT_ SRP s ; and 

(3) TXTDT_MG_EA describing the end address of 
TXTDT_MG with the relative block number from the first 
byte of the TXTDT_MG. 

Although not shown, each IT_TXT_SRP contains 
information items of: 

(a) IT_TXT_SA describing the start address of 
IT TXT with the relative block number from the first 
byte of TXTDT__MG ; and 

(b) IT_TXT_SZ describing the size of IT_TXT in 
bytes . 

Note that IT_TXT describes an item text with the 
character code specified by the above-mentioned CHRS. 

The embodiment of the present invention has 
several functions and data structures, such as 
Representative Picture, Disc Representative Picture, 
Disc Representative Name, Resume Marker, and Primary 
Text Information to be used in a player menu (cf . 
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FIGS . 6A and 6B) . 

The resume marker keeps information about the 
position where the playback has been suspended by a 
stop operation. Utilizing this information, playback 
5 can be resumed from the exact point where it has been 

suspended, even if disc 100 has been once ejected from 
and inserted again to disc drive 409 in FIG. 14. 

Each of tracks and play lists or disc 100 itself 
may have a representative picture. These 

10 representative pictures may be used to help a user to 

identify the target track, play list, or disc. Any 
audio still video included in disc 100 can be defined 
as the representative picture for the track and/or play 
list and as the disc representative picture of that 

15 disc. 

Most of those functions and data structures are 
defined as optional, which means that a Recorder or 
Player may not have a capability of handling those 
functions or data 

20 FIG. 39 illustrates an example of usage of primary 

text information (e. g., PRM_TXT shown in FIG. 29). 

As shown in FIG. 39, the primary text may be 
described for tracks, play lists, and/or entry points 
for index, and may be used to identify them. Primary 

2 5 text information PRM_TXTI may be described in two kinds 

of character sets, i.e., ASCII and one or more other 
sets. The ASCII character set is supported as a common 
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character set for world wide use of disc 100. 

The example of texts shown by a track set or those 
shown by a monitor TV screen correspond to the text of 
track title 3 shown in FIG* 6A. 
5 In the example of FIG . 39 , a player gathers 

PRM_TXTI for all tracks #1 to #3 and makes a menu 
(displayed on the monitor TV screen shown in FIG. 39) 
so that a user can easily select a track to be played 
back. For instance, the primary text "Symphony No. 9" 

10 for track #1, "Piano Concert" for track #2, and "Chorus 

Festival" for track #3 can be presented in the menu. 
The user can select, e.g., the "Piano Concert" from the 
menu so that the contents of track #2 recorded on the 
disc are played back. 

15 Also disc 100 may have a disc representative name 

(e.g., my disc, sister's disc, brother's disc) which 
can be used to identify the disc (e.g., my disc). For 
instance, a set ( recorder /player ) which can handle 
multiple discs (discs for me, for sister, and for 

2 0 brother) inside it may utilize this information to 

identify the target disc (e.g., my disc). 

FIG. 40 is an explanatory view of a presentation 
of Audio and Audio Still Video (ASVOB). 

ASVOBs shown in FIG. 40 correspond to still 

25 pictures No. 1 to No. 9 shown in (a) of FIG. 7. The 

audio presentation block shown in FIG. 4 0 corresponds 
to the block of audio tracks No. 1 to No. 3 shown in 
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(b) of FIG. 7. 

Presentation of ASVs is defined with the entry 
point for a display list in the original PGC or user 
defined PGC, and audio still video objects (ASVOBs) are 
5 defined as presentation data* 

Audio still video unit ASVU is the collection of 
one or more (up to "99") ASVOBs which are presented 
while one or more cells (or tracks) are played back, 
and ASVU is pre-loaded into a memory (ASVU buffer) 
10 before starting the presentation of the cell(s). 

During the loading of ASVU, audio output may be muted 
by the player. 

One or more ASVUs can be recorded in disc 100 f and 
the size of each ASVU shall be equal to or less than 
15 2M bytes. 

Since all the ASVOB data in ASVU are kept in the 
ASVU buffer, a variety of presentation functionality 
are realized in relation to display order and display 
timing of ASVOBs . For instance, slideshow and 
20 brows able pictures can be realized by use of the ASVU 

buffer* The display timing of each of ASVOBs kept in 
the ASVU buffer can be freely determined by, e.g., main 
MPU 404 shown in FIG. 14. 

FIG, 41 shows an example of the structure of an 
25 original PGC (ORG_PGCI shown in FIG. 26). 

Programs #1, #2, ♦ in PGCI of FIG. 41 correspond 
to program information items 311 to 314 in (g) of 
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FIG. 7. Cells #1, #2, ... in PGCI of FIG. 41 
correspond to original cell information items 301 to 

305 in (f) of FIG. 7. AOBs #1, #2, in the 

AR_AUDIO.ARO file of FIG. 41 correspond to AOBs #1, 
5 #2, ... in (e) of FIG. 7. AR_AUDIO.ARO file in FIG . 41 

corresponds to 221 in (f) of FIG. 8. 

As shown in FIG. 41, the original PGC is formed of 
PGCI, one or more AOBIs, and one or more AOBs. PGCI 
consists of a chain of program, and each program 

10 consists of either one cell or a sequence of more than 

one cells. Therefore, the total presentation of the 
original 'PGC is a sequence of cell presentation. The 
cell presentation order is the same as the order in 
which cell information is described in PGCI. 

15 In order to enable presentation of each cell, cell 

information (CI) includes AOB number and presentation 
start time. The presentation start time shall be less 
than the presentation period of the first AOBU. 
Therefore, the presentation of AOB shall start from the 

2 0 top of AOB or from the middle of the first AOBU. 

In the original PGC, each cell refers to the 
presentation period of a whole AOB, unless any editing 
such as partial deleting occurs. After any editing, 
cell in the original PGC may not refer to a whole AOB. 

25 This is because the AOBU boundary does not give enough 

cutting resolution. Desired accuracy is at least the 
period of a coding block. But all AOBUs except for the 
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last AOBU in an AOB shall have the fixed presentation 
period. So, the AOBU which includes the cutting end 
portion remains in the new AOB, and the cell should 
refer to the segment of the AOBU. 
5 On the contrary, AOBU which includes the cutting 

start position needs not to remain as a whole AOBU. 
Therefore, the segment of AOBU after the cutting start 
position is deleted is deleted, and the segment before 
the cutting end position remains with the new AOB. 

10 In order to access an AOB recorded in a . ARO file, 

AOBI is used to obtain address information with respect 
to the file. There are two types of AOBI, one is for 
coding of constant bit-rate and the other is for coding 
of variable bit-rate. When coding is the constant bit- 

15 rate, the size of each AOBU except for the last AOBU is 

constant. When coding is the variable bit-rate, the 
size of each AOBU is different. The presentation 
period of each AOBU except for the last AOBU is fixed 
in case of both constant bit-rate and variable bit-rate. 

20 When an AOB is created, it is appended at the end 

of .ARO file and an associated cell, and possibly an 
associated program is appended at the end of PGCI. 

FIG. 42 shows an example of the structure of a 
user-defined PGC (UD_PGCIT shown in FIG. 1 or FIG. 26). 

25 Cells in PGCI #n of user defined PGC #n shown in 

FIG. 42 correspond to cell information CI in (d) of 
FIG. 7. AOBIs shown in FIG. 42 correspond to AOBI in 
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(e) of FIG. 7. 

As shown in FIG. 42, user defined PGC #n is 
created so that the cells in the PGC refer to AOBs in 
the original PGC. The user defined PGC #n has for 
5 cells (#1 to #4). Two of them refer to AOB #1 and 

other two refer to AOB #2 . The downward arrows from 
the cells in the user defined PGC to the original PGC 
illustrate the presentation periods for these cells. 
The order of cell presentation in the user defined PGC 

10 may become totally different from the presentation of 

the original PGC. 

FIG. 43 is a view for explaining an example of an 
entry point for the representative audio. 

Each of tracks can have a representative audio (cf . 

15 the second paragraph in the "information contents" of 

the table shown in FIG. 9A) which may be used to help a 
user to identify the target track same as representa- 
tive pictures. The representative audio for a track is 
one specified segment of the track. The number of 

2 0 representative audio for one track is "one" at maximum. 

In other words, each track may optionally have one 
representative audio . 

The start position and duration of representative 
audio for each track are described at an entry point 

25 (the upward arrow with {R} in FIG. 43) for represen- 

tative audio (cf. the second paragraph in the 
"information contents" of the table shown in FIG. 9A) . 
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The start position of representative audio corresponds 
to the timing information of this entry point. The 
entry point for representative audio has an additional 
information which specifies the duration of this 
5 representative audio. 

FIG. 43 shows an example of the entry point for 
representative audio in the original PGC. There are 
three programs PG #1, PG #2, and PG #3. Only PG #2 has 
the entry point {R} for representative audio. This 
10 entry point {R} is described at the same entry point 

table as the entry points for index and for display 
list* 

FIG. 44 shows an example of contents of cell entry 
point information (C_EPI of type D2; C_EPI# shown in 
15 FIG. 27 or (h) of FIG. 1). 

As shown in FIG. 44 , C__EPI of type D2 includes 
information items of : 

( 1 ) EP_TY describing a type of the entry point 
{this EP_TY is formed of EP_TY1 and EP_TY2 , and the 

2 0 tyP e °2 is indicated by a combination of the contents 

of EPJTY1 = "00b" and EP_TY2 = "lib".); 

(2) EP_PTM describing the presentation time of 
the entry point (this presentation time indicates the 
start time of representative audio); 

25 (3) RA_DUR describing the duration of 

representative audio. 

Incidentally , EP_TY corresponds to the entry point 
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described in the second paragraph of the "information 
contents" of the table shown in FIG. 9A. Further, 
RA_DUR corresponds to the range between the playback 
start time and the playback end time described in the 
second paragraph of the "information contents" of the 

table shown in FIG. 9A. 

FIG. 45 shows an example of contents of cell entry 
point information (C_EPI of type Bl; C_epi# shown in 
FIG. 27 or (h) of FIG. 1). 

As shown in FIG. 45, C_EPI of type Bl includes 

information items of: 

(1) EP_TY describing a type of the entry point 
(this EP_TY is formed of EP_TY1 and EP_TY2, and the 
type Bl is indicated by a combination of the contents 
of EP_TY1 = "01b" and EP_TY2 = "01b".); 

(2) EP_PTM describing the presentation time of 

the entry point; 

(3) IDXN describing the index number of the index 
point which is specified by the entry point; and 

(4) PRM_TXT describing the primary text for the 
entry point. 

Note that when the cell belongs to the original 
PGC and the entry point is the first one in the program 
( PG), IDXN shall be "1". When this cell belongs to the 
user defined PGC and this entry point has the entry 
point for user defined track, IDXN shall be "1". 

When the entry point does not satisfy the above 
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condition, IDXN of this entry point shall be IDXN of 
the previous type B entry point plus one (i.e., IDXN of 
this entry point is incremented by 1). 

Note that the previous entry point may be included 
5 in the previous cell. 

Also note that PRM_TXT corresponds to the primary 
text information PRM__TXTI described in the 5th 
paragraph of the "information contents" of the table 
shown in FIG. 9A. 
10 FIG. 4 6 shows an example of contents of cell entry 

point information (C_EPI of type B2 ; C_EPI# shown in 
FIG. 27 or (h) of FIG. 1). 

As shown in FIG. 46, C_EPI of type B2 includes 
information items of : 
15 (1) EP_TY describing a type of the entry point 

(this EP__TY is formed of EP_TY1 and EP_TY2 , and the 
type B2 is indicated by a combination of the contents 
of EP_TY1 = "00b" and EP_TY2 = "01b".); 

(2) EP_PTM describing the presentation time of 
20 the entry point; and 

(3) IDXN describing the index number of the index 
point which is specified by the entry point. 

The type B2 C_EPI is obtained by deleting PRM_TXT 
from type Bl. 

25 FIG. 4 7 shows an example of contents of cell entry 

point information (CJEPI of type C2; C__EPI# shown in 
FIG. 27 or (h) of FIG. 1). 
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As shown in FIG. 47, C_EPI of type C2 includes 
information items of: 

( 1 ) EP_TY describing a type of the entry point 
(this EP_TY is formed of EP_TY1 and EP__TY2 , and the 

5 type C2 is indicated by a combination of the contents 

of EP_TY1 = "00b" and EP_TY2 = "10b".)? 

(2) EP__PTM describing the presentation time of 
the entry point (the field of this EP_PTM is filled 
with "OOh" if the display timing mode of the cell is 

10 brows able); 

( 3 ) ASVOB_ENTN describing the entry number of 
ASVOB of the entry point (if the display order mode of 
the cell is random or shuffle, the field of ASVOB_ENTN 
is set to "00h" ) ; 

15 (4) HOME_DLISTN describing the entry point number 

which specifies the home display list (Home DLIST) in 
the cell (in every entry point of type C2 , the value of 
HOME__D LIS TN shall be identical; when there is no Home 
DLIST in this cell, "OOh" is entered to HOME_DLISTN; if 

2 0 the display timing mode of this cell is slideshow, 

"OOh" is entered; if the display order mode of this 
cell is random or shuffle, "00h" is entered? note that 
H OME_DL I S TN shall be equal to or less than the number 
of entry points in this cell); 

25 (5) S^EFFECT describing visual effect (such as 

cut-out/cut-in, fade-out/fade-in, dissolving, wiping, 
etc.) information when the display of the previous 
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ASVOB terminates and the display of this ASVOB starts; 

(6) E_EFFECT describing visual effect information 
when the display of the ASVOB and the presentation of 
the ASVU are terminated; 
5 (7) MAX_DUR describing the maximum duration to 

update ASVOBs measured in "Duration = MAX_DUR x video 
frame" where one video frame means 1/29.27 in case of 
NTSC and 1/25 in case of PAL (if the display timing 
mode of the cell is slideshow, the field of MAX_DUR is 
10 filled with n 00h"); and 

(8) MIN_DUR describing the minimum duration to 
update ASVOBs measured in "Duration = MIN_DUR X video 
frame" . 

Note that when the duration of MAX_DUR or MIN_DUR 
15 is infinite, the fields of MAX_DUR and MIN_DUR are 

filled with "OOh" . 

Note that when the maximum duration time and 
minimum duration time are the same, the duration time 
for updating ASV is always fixed. When the maximum 
20 duration time and minimum duration time are different, 

the duration time for updating ASV is randomly changed 
(by a player) between the maximum duration time and the 
minimum duration time. 

Note that the minimum duration time shall be more 
25 than 0.4 second. The maximum duration time shall be 

equal to or more than the minimum duration time. 

Also note that EP_TY of type C2 corresponds to the 
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entry point type information (EP_JTY) described in the 
first paragraph of the "information contents" with 
respect to "still picture entry points 21 to 26" shown 
in FIG. 9B. EP_PTM of type C2 corresponds to the 
5 information (EP_PTM) described in the third paragraph 

thereof. ASVOB__ENTN of type C2 corresponds to the 
information (ASVOB__ENTN) described in the second 
paragraph thereof. MAXJDUR and MIN_DUR of type C2 
correspond to the information (MAX_DUR & MIN__DUR) 
10 described in the 4th paragraph thereof. 

FIG. 48 shows an example of contents of PGC 
general information (PGC_GI shown in FIG. 2 7 or (g) of 
FIG. 1). 

As shown in FIG. 48, PGC_GI includes PG__Ns 
15 describing the number of programs; and CI_SRP_Ns 

describing the number of cell information search 
pointers CI_SRPs. 

Note that in case of a user defined PGC, PG_Ns 
shall be set to "0". The maximum number of programs 
20 for the original PGC is "99". The maximum number of 

CI_SRPs is "999". 

FIG. 4 9 shows an example of contents of program 
information (PGI# shown in FIG. 27). 

As shown in FIG. 49, PGI includes the information 
25 items of: 

(1) PG_TY describing the type of the program; 

(2) C_Ns describing the number of cells in the 
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program; 

( 3 ) PRM_TXTI describing the primary text 
information for the program; 

( 4 ) IT_TXT__SRPN describing the search pointer 
5 number of the item text whose text data corresponds to 

the program; and 

(5) REP_PICTI describing representative picture 
information (cf. FIG* 50). 

Note that PG_TY includes a protect flag. When 
10 this flag is "Ob", the program is not in a protected 

state. When this flag is "lb" , the program is in a 
protected state. 

When a program is in the protected state, all AOBs 
and ASVOBs referred to and utilized in the presentation 
15 of that program shall not be temporarily or permanently 

erased. 

The protect flag shall not be set to "lb" unless 

all AOBs and ASVOBs referred to and utilized by the 

program are in normal state. 
2 0 FIG. 50 shows an example of contents of 

representative picture information (REP_PICTI shown in 

FIG. 29 or FIG. 49) . 

As shown in FIG. 50, REPJPICTI includes 

information items of : 
25 (a) ASVUN describing ASVU number (e.g., #1 of 

ASVU shown in (f) of FIG. 10, FIG. 28A, etc.) in which 

the representative picture for a track exists; and 
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(b) ASVOB_ENTN describing the ASVOB_ENT number 
(e.g., #1 of ASV0B_ENT shown in (h) of FIG. 4, (g) of 
FIG. 12, etc.) in which the representative picture for 
the track exists. 
5 Incidentally, REP__PICTI corresponds to 41 to 44 in 

(d) of FIG. 12. ASVUN corresponds to 273 to 274 in (g) 
of FIG. 12. ASVOB_ENTN corresponds to any of 281 to 
296 in (g) of FIG. 12. 

The effects of the present invention mentioned 
10 above are summarized as follows. 

1. Since break information of audio tracks is 
provided to PGC (Program Chain) information that 
indicates the playback sequence of audio information in 
management information, the same data structure and 

15 data structure layers (hierarchical structure of 

PGC/ (Program) /cell) as those of the existing Video 
Recording specifications can be assured in the 
management information . 

As a result, high compatibility with the existing 

2 0 Video Recording specifications can be assured, and 

cross-reference between video information (movie 
object) and audio information can be made upon playback* 
Since break information of audio tracks is recorded in 
management information, recording, playback, and edit 

25 processes in units of audio tracks unique to audio 

information can be very easily done. 

2 . Since a program in original program chain 
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information 144 as management information that pertains 
to an original track corresponds to the original track, 
and "text information that pertains to an audio track", 
"representative picture indicating the audio track 
5 contents" , "display mode of still pictures", "erase 

inhibition flag", and the like, which are unique to 
each audio track are recorded in program information 
(311 to 314) as an information recording area that 
pertains to the program, various kinds of information 
10 of individual audio tracks can be flexibly controlled, 

and recording, playback, and edit processes in units of 
audio tracks can be very easily done. 

3. Since track head entry points 171 to 173 
indicating break information of audio tracks are 

15 recorded in cell information (164 to 169) in user- 

defined PGC information table 145 as management 
information that pertains to a play list, and various 
kinds of information such as "text information that 
pertains to an audio track", "representative picture 

20 indicating the audio track contents", "display mode of 

still pictures", "erase inhibition flag", and the like, 
which are unique to each audio track are provided to 
track head entry points 171 to 173, various kinds of 
information of individual audio tracks can be flexibly 

25 controlled, and recording, playback, and edit processes 

in units of audio tracks can be very easily done. 

4. When the user designates still pictures to be 
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displayed simultaneously with playback of an audio 
track, the display timings of individual still pictures 
upon playing back audio information are automatically 
computed on the basis of the playback time of the audio 
5 track and the number of designated still pictures, and 

are automatically recorded in management information. 
Hence, the display times of still pictures can be very 
easily set without any load on the user. 

5. Since an original list and play list are 

10 simultaneously displayed on the screen (which can be 

applied to video recording in addition to audio 
recording), the user can easily create the play list. 

6 . When a new track is created on the play list 
by collecting portions of original tracks in the 

15 original list or when the contents of an original track 

are partially erased, if one of: 

A] a process for using all still pictures 
displayed upon playing back the original track as those 
to be displayed upon playing back the new track on the 

20 pl a y list, or also displaying all still pictures 

displayed before partial erase after partial erase; and 

B] a process for using only still pictures 
falling within a specific range of those displayed upon 
playing back the original track as those to be 

2 5 displayed upon playing back the new track on the play 

list, or not displaying still pictures displayed within 
the partial erase range after partial erase 
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to be selected differs depending on edit machines 
(information recording/playback apparatuses ) , the user 
confuses. In addition, if an edit machine (information 
recording/playback apparatus) automatically selects one 
of processes (A) and (B) without notice of the user, 
the user feels disturbed. 

Since discrimination information indicating 
process (A) or (B) to be executed is displayed in the 
form of display mode 7 on the screen on which the user 
creates a play list, the user can understand the setup 
method of still pictures with respect to a new track, 
and can be prevented from feeling disturbed, and still 
picture setup consistency can be maintained 
independently of the models of edit machines 
(information recording/playback apparatuses). 

7. Since an arbitrary scene of a movie object is 
extracted as a still picture, and is registered in 
still picture AV file information table 142 as a still 
picture that can be displayed simultaneously with an 
audio object, an arbitrary scene of a movie object can 
be used as a still picture that can be displayed 
simultaneously with audio information. At the same 
time, since a plurality of pieces of still picture 
information can be recorded together in a given area, 
high-speed access to still pictures can be made, and 
still pictures and audio information can be 
continuously played back without being interrupted. 



8. In the present invention , still pictures are 
designated in units of tracks, and designation 
information of a representative picture that indicates 
the track contents is provided to the management 
information independently of designation information of 
still pictures to be displayed upon playback of an 
audio track. Hence, an arbitrary still picture in a 
VOB group at a location other than a video object (VOB) 
group that records still pictures to be displayed upon 
playing back an audio track can be set as a 
representative picture, thus improving the degree of 
freedom. 

9 . Since display range information of 
representative audio indicating the contents of a given 
audio track is stored in an area that records unique 
information in units of audio tracks, the user need 
only designate an audio track to be confirmed on the 
display window shown in FIGS* 6A and 6B, so as to 
confirm if the track he or she designated is the one he 
or she really wants to listen to, without listening to 
all audio tracks, thus allowing the user to very easily 
search audio tracks. 

Additional advantages and modifications will 
readily occur to those skilled in the art. Therefore, 
the invention in its broader aspects is not limited to 
the specific details and representative embodiments 
shown and described herein. Accordingly, various 



modifications may be made without departing from the 
spirit or scope of the general inventive concept as 
defined by the appended claims and their equivalents. 



